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NOVEL BT TOXIN RECEPTORS FROM LEPIDOPTERAN INSECTS AND 

METHODS OF USE " 

FIELD OF THE INVENTION 

The field of the invention is manipulating Bt toxin susceptibility in pliant pests. 
The field of the invention relates to the isolation and characterization of nucleic acid and 
polypeptides for a novel Bt toxin receptor. The nucleic acid and polypeptides are useful 
in developing new insecticides. 

5 

BACKGROUND OF THE INVENTION 
Traditionally, growers used chemical pesticides as a means to control 
agronomically important pests. The introduction of transgenic plants carrying the 
delta-endotoxin from Bacillus thuringiensis (Bt) afforded a non-chemical method of 

10 control. Bt toxins have traditionally been categorized by their specific toxicity 

towards specific insect categories. For example, the Cryl group of toxins are toxic to 
Lepidoptera The Cryl group includes, but is not limited to, CrylA(a), CrylA(b) and 
CrylA(c). See Hofle et al (1989) Microbiol Rev 53: 242-255. 

Lepidopteran insects cause considerable damage to maize crops throughout 

15 North America and the world. One of the leading pests is Ostrinia nubilalis, 
commonly called the European Corn Borer (ECB). Genes encoding the crystal 
proteins CrylA(b) and CrylA(c) from Bt have been introduced into maize as a means 
of ECB control. These transgenic maize hybrids have been effective in control of 
ECB. However, developed resistance to Bt toxins presents a challenge in pest 

20 control. See McGaughey et al (1998) Nature Biotechnology 16: 144-146; Estruch et 
al (1997) Nature Biotechnology 75:137-141; Roush et al (\991)Nature 
Biotechnology 15 8 1 6-8 1 7; and Hofte et al ( 1 989) Microbiol Rev S3: 242-255. 

The primary site of action of Cryl toxins is in the brush border membranes of 
the midgut epithelia of susceptible insect larvae such as lepidopteran insects. Cryl A 

25 toxin binding polypeptides have been characterized from a variety of Lepidopteran 
species. A CrylA(c) binding polypeptide with homology to an aminopeptidase N has 
been reported from Manduca sexia y Lymantria dispar, Helicoverpa zea and Heliothis 
virescens . See Knight et al (1994) Mol Micro 11: 429-436; Lee et al (1996) Appl 
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Environ Micro 63: 2845-2849; Gill et al (1995) J Biol Chem 270: 27277-27282; and 
Garczynski et al. (1 99 1 ) Appl Environ Microbiol 10: 28 1 6-2820. 

Another Bt toxin binding polypeptide (BTR1) cloned from M. sexta has 
homology to the cadherin polypeptide superfamily and binds CrylA(a), CrylA(b) and 
5 CrylA(c). See Vadlamudi et al (1995) J Biol Chem 270(1 ^:5490-4, Keeton et al 
(1998) Appl Environ Microbiol 5^:2158-2165; Keeton et al (1997) Appl Environ 
Microbiol 63(9) :34 19-3425 and U.S. Patent Patent No: 5,693,491 . 

A subsequently cloned homologue to BTR1 demonstrated binding to CrylA(a) 
from Bombyx mori as described in Ihara et al (1 998) Comparative Biochemistry and 
10 Physiology, Part B 720:197-204 and Nagamatsu et al (1998) Biosci. Biotechnol 
Biochem.62(4):727-734. " 

Identification of the plant pest binding polypeptides for Bt toxins are useful for 
investigating Bt toxin-2?/ toxin receptor interactions, selecting and designing improved 
toxins, developing novel insecticides, and new Bt toxin resistance management 
15 strategies. 

SUMMARY OF THE INVENTION 
Compositions and methods for modulating susceptibility of a cell to Bt toxins 
are provided. The compositions include Bt toxin receptor polypeptides, and 
20 fragments and variants thereof, from the lepidopteran insects European com 
borer(EGB, Ostrinia nubilalis), corn earworm (GEW, Heliothis Zed), and fall 
armyworm (FAW, Spodopterafrugiperda). The polypeptides bind CrylA toxins, 
more particularly Cry 1 A(b). Nucleic acids encoding the polypeptides, antibodies 
specific to the polypeptides, as well as nucleic acid constructs for expressing the 
25 polypeptides in cells of interest are also provided. 

The methods are useful for investigating the structure-function relationships of 
ifrtoxin receptors; investigating the toxin-receptor interactions; elucidating the mode 
of action of Bt toxins; screening and identifying novel Bt toxin receptor ligands 
including novel insecticidal toxins; ai>d designing and developing novel Bt toxin 
30 receptor ligands. - r: ;o 

The methods are useful for ^ managing J?/ toxin resistance in pl^t pests, m 
protecting plants against damage by plant pests,; 

2 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 schematically depicts the location of the signal sequence, putative glycosilation 
sites, cadherin-like domains, transmembrane segment, CrylA bindingTegion and protein 
kinase C phosphory lation site of the Bt toxin receptor from Ostrinia nubilalis; the 
5 nucleotide sequence of the receptor set forth in SEQ ID NO: 1 and the corresponding 
deduced amino acid sequence in SEQ ID NO:2. 

DETAILED DESCRIPTION OF THE INVENTION 

1 0 The invention is directed to novel receptor polypeptides that bind Bt toxin, the 

receptor being derived from the order lepidoptera. The receptors of the invention 
include those receptor polypeptides that bind Bt toxin and are derived from the 
lepidopteran superfamily Pyraloidea and particularly from the species Ostrinia, 
specifically Ostrinia nubilalis; those derived from Spodoptera frugiperda (& 

15 frugiperda); and those derived from Heliothus Zea (H. Zed). The polypeptides have 
homology to members of the cadherin superfamily of proteins. 

Accordingly, compositions of the invention include isolated polypeptides that 
are involved in Bt toxin binding. In particular, the present invention provides for 
isolated nucleic acid molecules comprising nucleotide sequences encoding the amino 

20 acid sequences shown in SEQ ID NOs: 2, 4, and 6; or the nucleotide sequences 
having the DNA sequences deposited in a plasmid in a bacterial host as Patent 
Deposit No. PTA-278, PTA-1 760, and PTA-2222. Further provided are polypeptides 
having an amino acid sequence encoded by a nucleic acid molecule described herein, 
for example those set forth in SEQ ID NOs: 1, 3, and 5; those deposited in a plasmid 
-25 in a bacterial host as Patent Deposit Nos. PTA-278, PTA-1760, and PTA-2222; and 
fragments and variants thereof 

Plasmids containing the nucleotide sequences of the invention were deposited 
with the Patent Depository of the American Type Culture Collection (ATCC), 
Manassas, Virginia on June 25, 1999; April 25, 2000; and July 11, 2000; and assigned 

30 Patent Deposit Nos. PTA-278, PTA^1760, and PTA-2222. These deposits will be 

maintained under the terms of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purposes of Patent Procedure: These deposits 

- - • ■ - " 3 - 
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were made merely as a convenience for those of skill in the art and are not an 
admission that a deposit is required under 35 UlS.C. §112. 

The term "nucleic acid" refers to all forms of DNA such as cDNA or genomic 
DNA and RNA such as mRNA, as well as analogs of the DNA or RN A generated using 
5 nucleotide analogs. The nucleic acid molecules can be single stranded or double 
stranded. Strands can include the coding or non-coding strand. 

The invention encompasses isolated or substantially purified nucleic acid or 
polypeptide compositions. An "isolated*' or "purified" nucleic acid molecule or 
polypeptide, or biologically active portion thereof, is substantially free of other 

10 cellular material, or culture medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other chemicals when chemically 
synthesized. Preferably, an "isolated" nucleic acid is free of sequences (preferably 
polypeptide encoding sequences) that naturally flank the nucleic acid (i.e., sequences 
located at the 5* and 3* ends of the nucleic acid) in the genomic DNA of the organism 

1 5 from which the nucleic acid is derived. For example, in various embodiments, the 

isolated nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 
0.5 kb, or 0.1 kb of nucleotide sequences that naturally flank the nucleic acid 
molecule in genomic DNA of the cell from which the nucleic acid is derived. A 
polypeptide that is substantially free of cellular material includes preparations of 

20 polypeptide having less than about 30%, 20%, 1 0%, 5%, (by dry weight) of 

contaminating polypeptide. When the polypeptide of the 1 invention or biologically 
active portion thereof is recombinantly produced, preferably culture medium 
represents less than about 3 0%, 20%, 1 0%, or 5% (by dry weight) of chemical 
precursors or non-pdlypeptide-of-interest chemicals. 

25 It is understood, however, that there are embodiments in which preparations that 

do not contain the substantially pure polypeptide may also be useful. Thus, less pure 
preparations can be useful where the contaminating material does not interfere with the 
h specific desired use of the peptide. The compositions of the invention also encompass 
fragments and variants of the disclosed nucleotide sequences and the polypeptides 

30 encoded thereby; * • «.-- 

The compositions of the invention are useful for, among other uses, expressing 
the receptor polypeptides in cells of interest to produce cellular or isolated ;i * 
preparations of the polpeptides for investigating the structure-function relationships of 

4 
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Bt toxin receptors; investigating the toxin-receptor interactions; elucidating the mode 
of action of Bt toxins; screening and identifying novel Bt toxin receptor ligands 
including novel insecticidal toxins; and designing and developing tiov&TBt toxin 
receptor ligands including novel insecticidal toxins. 
5 The isolated nucleotide sequences encoding the receptor polypeptides of the 

invention are expressed in a cell of interest; and the Bt toxin receptor polypeptides 
produced by the expression is utilized in intact cell or in-vitro receptor binding assays, 
and/or intact cell toxicity assays. Methods and conditions for Bt toxin binding and 
toxicity assays are known in the art and include but are not limited to those described 

10 in United States Patent NO: 5,693,491; T.P. Keeton et al (199S}Appl Environ. 
Microbiol. 64(6):2\5&-2l65; B.R. Francis et al. (1997) Insect Biochem. MoL Biol 
27(6):54U550; T.P. Keeton et al. (1997) Appl. Environ. Microbiol 63(9):34 19-3425; 
R.K. Vadlamudi et al (1995) J. Biol Chem. 270(1 0;:549O-5494; Ihara et al (1998) 
Comparative Biochem. Physiol B 720:197-204; Nagamatsu et al (1998) Biosci. 

1 5 Biotechnol Biochem. 62(4):727-734, herein incorporated by reference. Such methods 
could be modified by one of ordinary skill in the art to develop assays utilizing the 
polypeptides of the invention. 

By "cell of interest" is intended any cell in which expression of the polypeptides 
of the invention is desired. Cells of interest include, but are not limited to mammalian, 

20 avian, insect, plant, bacteria, fungi and yeast ceils. Cells of interest include but are not 
limited to cultured cell lines, primary cell cultures, cells in vivo 9 and cells of transgenic 
organisms. 

The methods of the invention encompass using the polypeptides encoded by the 
nucleotide sequences of the invention in receptor binding and/or toxicity assays^to 

25 screen candidate ligands and identify novel Bt toxin receptor ligands, including 
receptor agonists, and antagonists. Candidate ligands include molecules available 
from diverse libraries of small molecules created by combinatorial synthetic methods. 
Candidate ligands also include, but are not limited to antibodies, peptides, and other 
small molecules designed or deduced to interact with the receptor polypeptides of the 

30 invention. Candidate ligands include but are not limited to peptide fragments of the 
receptor, anti-receptor antibodies, antiidiotypie antibodies mimicking one or more 
receptor binding domains of a toxin, fusion proteins produced by combining two or 
more toxins or fragments thereof, and the like. Ligands identified by the screening 

5 
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methods of the invention include potential novel insecticidal toxins, the insecticidal 
activity of which can be determined by known methods; for example, as described in 
U.S. Patent No: 5,407,454; U.S. Application NO: 09/218,942; U.S. Apphcation No: 
09/003,217. 

5 The invention provides methods for screening for ligands that bind to the 

polypeptides described herein. Both the polypeptides and relevant fragments thereof (for 
example, the toxin binding domain) can be used to screen by assay for compourids that 
bind to the receptor and exhibit desired binding characteristics. Desired binding 
characteristics include, but are not limited to binding affinity, binding site specificity, 

10 association and dissociation rates, and the like. The screening assays could be intact cell 
or in vitro assays which include exposing a ligand binding domain to a sample ligand 
and detecting the formation of a ligand-binding polypeptide complex. The assays could 
be direct ligand-receptor binding assays or ligand competition assays. 

In one embodiment, the methods comprise providing at least one Bt toxin 

1 5 receptor polypeptide of the invention, contacting the polypeptide with a sample and a 
control ligand under conditions promoting binding; and determining binding 
characteristics of sample ligands, relative to control ligands. The methods encompass 
any method known to the skilled artisan which can be used to provide the 
polypeptides of the invention in a binding assay. For in vitro binding assays, the 

20 polypeptide may be provided as isolated, lysed, or homogenized cellular preparations. 
Isolated polypeptides may be provided in solution, or immobilized to a matrix. 
Methods for immobilizing polypeptides are well known in the art, and include but are 
_ not limited to construction and use of fusion polypeptides with commercially 
available high affinity ligands. For example, GST fusion proteins can be adsorbed 

25i> onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

derivatized microtitre plates. The polypeptides can also be immobilized utilizing well 
techniques in the art utilizing conjugation of biotin and streptavidin. The ? 
polypeptides can also be immobilized utilizing well known techniques in the art 
utilizing chemical conjugation (linking) of polypeptides to a matrix. Alternatively,* the 

30 polypeptides may be provided in intact cell binding assays in which the polypeptides 
are generally expressed as cell surface Bt toxin receptors. 

" The invention provides methods utilizing intact cell toxicity assays to screen for 
ligands that bind to the receptor polypeptides described herein and confer toxicity upon a 

6 
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cell of interest expressing the polypeptide. A ligand selected by this screening is a 
potential insecticidal toxin to insects expressing the receptor polypeptides, particularly 
enterally. This deduction is premised on theories that insect specificity ofVparticular Bt 
toxin is determined by the presence of the receptor in specific insect species, or that 
5 binding of the toxins is specific for the receptor of some insect species and is bind is 
insignificant or nonspecific for other variant receptors. See, for example Hofte et al 
(1 989) Microbiol Rev 53 : 242-255. The toxicity assays include exposing, in intact cells 
expressing a polypeptide of the invention, the toxin binding domain of the polypeptide to 
a sample ligand and detecting the toxicity effected in the cell expressing the polypeptide. 
10 By "toxicity" is intended the decreased viability of a cell. By "viability" is intended the 
ability of a cell to proliferate and/or differentiate and/or maintain its biological 
characteristics in a manner characteristic of that cell in the absence of a particular 
cytotoxic agent. 

In one embodiment, the methods of the present invention comprise providing 
15 at least one cell surface Bt toxin receptor polypeptide of the invention comprising an 
extracellular toxin binding domain, contacting the polypeptide with a sample and a 
control ligand under conditions promoting binding, and determining the viability of 
the cell expressing the cell surface Bt toxin receptor polypeptide, relative to the 
control ligand. 

20 By "contacting" is intended that the sample and control agents are presented to 

the intended ligand binding site of the polypeptides of the invention. 

By "conditions promoting binding" is intended any combination of physical 
and biochemical conditions that enables a ligand of the polypeptides of the invention 
to determinably bind the intended polypeptide over background levels. Examples of 

25 such conditions for binding of Cry 1 toxins to Bt toxin receptors, as well as methods 
for assessing the binding, are known in the art and include but are not limited to those 
described in Keeton et al (1998) Appl Environ Microbiol 64(6): 2158-2165; Francis 
etal (1 997) Insect Biochem Mol Biol 27^:541-550; Keeton et al (1 997) Appl 
Environ Microbiol ' 63^:3419^3425; VacUa^ 

30 270(1 0)\5A9Q^5A9A\ Ihara e/ai (1998) Comparative Biochemistry arid Physiology, 
Part B 720:197-204; and Nagamatsu et al ( 1 99S) Biosct Biotechnol Biochem. . 
62^:727-734, the contents of which are herein incorporated by reference. In this 
aspect of the present invention, known and commercially available methods fori : - , 

7 
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studying protein-protein interactions, such as yeast and/or bacterial two-hybrid 
systems could also be used. Two-hybrid systems are available from, for example, 
CLONTECH (Palo Alto, Ca) or Display Systems Biotech Inc. (Vista, Ca)7 

The compositions and screening methods of the invention are useful for 
5 designing and developing novel Bt toxin receptor ligands including novel insecticidal 
toxins. Various candidate ligands; ligands screened and characterized for binding, 
toxicity, and species specificity; and/or ligands having known characteristics and 
specificities, could be linked or modified to produce novel ligands having particularly 
desired characteristics and specificities. The methods described herein for assessing 
10 binding, toxicity and insecticidal activity could be used to screen and characterize the 
novel ligands. 

In one embodiment of the present invention, the sequences encoding the 
receptors of the invention, and variants and fragments thereof, are used with yeast and 
bacterial two-hybrid systems to screen for Bt toxins of interest (for example, more 

15 specific and/or more potent toxins), or for insect molecules that bind the receptor and 
can be used in developing novel insecticides. 

By "linked" is intended that a covalent bond is produced between two or more 
molecules. Known methods that can be used for modification and/or linking of 
polypeptide ligands such as toxins, include but are not limited to mutagenic and 

20 recombinogenic approaches including but not limited to site-directed mutagenesis, 

chimeric polypeptide construction and DNA shuffling. Such methods are described in 
further detail below. Known polypeptide modification methods also include methods 
for covalent modification of polypeptides. "Operably linked" means that the linked 
molecules carry out the function intended by the linkage. 

25 The compositions and screening methods of the present invention are useful 

for targeting ligands to cells expressing the receptor polypeptides of the invention. 
For targeting, secondary polyeptides, and/or small molecules which do not bind the 
receptor polypeptides of the invention are linked with one or more primary ligands 
which bind the receptor polypeptides; including but not limited to Cry 1 A toxSn; more 

30 particularly Cry 1 A(b) toxin or a fragment thereof. By this linkage, any polypeptide 
and/or small molecule linked to a primary ligand could be targeted to the receptor 
polypeptide, and thereby to a cell expressing the receptor polypeptide; wherein the 
ligand binding site is available at the extracellular surface of the cell. 
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In one embodiment of the invention, at least one secondary polypeptide toxin 
is linked with a primary Cryl A toxin capable of binding the receptor polypeptides of 
the invention to produce a combination toxin which is targeted and iotfie^ to insects 
expressing the receptor for the primary toxin. Such insects include those of the order 
5 lepidoptera, superfamily Pyraloidea and particularly from the species Ostrinia, 

specifically Ostriniamibilalis. Such insects include the lepidopterans S.frugiperda 
and H. Zea. Such a combination toxin is particularly useful for eradicating or 
reducing crop damage by insects which have developed resistance to the primary 
toxin. 

1 0 For expression of the Bt toxin receptor polypeptides of the invention in a cell 

of interest, the Bt toxin receptor sequences are provided in expression cassettes. The 
cassette will include 5' and 3' regulatoiy sequences operably linked to a Bt toxin 
receptor sequence of the invention. In this aspect of the present invention, by 
"operably linked" is intended a functional linkage between a promoter and a second 

1 5 sequence, wherein the promoter sequence initiates and mediates transcription of the 
DNA sequence corresponding to the second sequence. In reference to nucleic acids, 
generally, operably linked means that the nucleic acid sequences being linked are 
contiguous and, where necessary to join two polypeptide coding regions^ contiguous 
and in the same reading frame. The cassette may additionally contain at least one 

20 additional gene to be cotransformed into the organism. Alternatively, the additional 
gene(s) can be provided on multiple expression cassettes. 

Such an expression cassette is provided with a plurality of restriction sites for 
insertion of the Bt toxin receptor sequence to be under the transcriptional regulation of 
the regulatory regions. The expression cassette may additionally contain selectable 

25 marker genes. 

The expression cassette will include in the 5-3* direction of transcription, a 
transcriptional and translational initiation region, a Bt toxin receptor nucleotide 
sequence of the invention, and a transcriptional and translational termination region 
functional in host cells. The transcriptional initiation region, the promoter, may be 

30 native or analogous, or foreign or heterologous to the plant host Additionally, the 
promoter may be the natural sequence or alternatively a synthetic sequence. By 
"foreign" is intended that the transcriptional initiation region is not found in the native 
host cells into which the transcriptional initiation region is introduced. As used 
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herein, a chimeric gene comprises a coding sequence operably linked to a 
transcription initiation region that is heterologous to the coding sequence. 

While it may be preferable to express the sequences using heterologous 
promoters, the native promoter sequences may be used. Such constructs would 
5 change expression levels of Bt toxin receptor in the cell of interest. Thus; the 
phenotype of the cell is altered. 

The termination region may be native with the transcriptional initiation region, 
may be native with the operably linked DNA sequence of interest, or may be derived 
from another source. 

1 -0 Where appropriate, the gene(s) may be optimized for increased expression in a 

particular transformed cell of interest. That is, the genes can be synthesized using 

host cell-preferred codons for improved expression. 

Additional sequence modifications are known to enhance gene expression in a 

cellular host. These include elimination of sequences encoding spurious 
1 5 polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and 

other such well-characterized sequences that may be deleterious to gene expression. 

The G-C content of the sequence may be adjusted to levels average for a given 

cellular host, as calculated by reference to known genes expressed in the host cell. 

When possible, the sequence is modified to avoid predicted hairpin secondary mRNA 
20 structures. 

The expression cassettes may additionally contain 5' leader sequences in the 
expression cassette construct. Such leader sequences can act to enhance translation. 
Translation leaders are known in the art and include: picornavirus leaders, for 
example, EMCV leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein et 

25 al (1989) PNAS USA £6:6126-6130); potyvirus leaders, for example, TEV leader 
(Tobacco Etch Virus) (Allison et al (1986); MDMV leader (Maize Dwarf Mosaic 
Virus); Virology 154:9-20), and human immunoglobulin heavy-chain binding 
polypeptide (BiP), (Macejak e/ al (1991) Nature 555:90-94); untranslated leader from 
the coat polypeptide mRNA of alfalfa mosaic virus (AMV RNA 4) (Jobling et al 

30 (1987) Nature 525:622-625); tobacco mosaic virus leader (TMV) (Gallie e/ al (1989) 
in Molecular Biology of RNA, ed. Cech (Liss, New York), pp. 237-256); and maize 
chlorotic mottle virus leader (MCMV) (Lommel et al (1991) Virology 57:382-385). 

10 
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See also, Della-Cioppa et al (1987) Plant Physiol £4:965-968. Other methods 
known to enhance translation can also be utilized, for example, introns, and the like. 

In preparing the expression cassette, the various DN A fragments-may be 
manipulated, so as to provide for the DNA sequences in the proper orientation and, as 

5 appropriate, in the proper reading frame. Toward this end, adapters or linkers may be 
employed to join the DNA fragments or other manipulations may be involved to 
provide for convenient restriction sites, removal of superfluous DNA, removal of 
restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, 
restriction, annealing, resubstitutions, e.g., transitions and transversions, may be 

10 involved. 

Using the nucleic acids of the present invention, the polypeptides of the 
invention could be expressed in any cell of interest, the particular choice of the cell 
depending on factors such as the level of expression and/or receptor activity desired. 
Cells of interest include, but are not limited to conveniently available mammalian, plant, 
1 5 insect, bacteria, and yeast host cells. The choice of promoter, terminator, and other 

expression vector components will also depend on the cell chosen. The cells produce 
the protein in a non-natural condition (e.g., in quantity, composition, location, and/or 
time), because they have been genetically altered through human intervention to do 
so. 

20 It is expected that those of skill in the art are knowledgeable in the numerous 

expression systems available for expression of a nucleic acid encoding a protein of the 
present invention. No attempt to describe in detail the various methods known for the 
expression of proteins in prokaryotes or eukaryotes will be made. 

In brief summary, the expression of isolated nucleic acids encoding a protein 

25 of the present invention will typically be achieved by operably linking, for example, 
the DNA or cDNA to a promoter, followed by incorporation into an expression 
vector. The vectors can be suitable for replication and integration in either 
prokaryotes or eukaryotes. Typical expression vectors contain transcription and 
translation terminators, initiation sequences^ and promoters useful for regulation of 

30 the expression of the DNA encoding a protein of the present invention. To obtain 
high level expression of a cloned gene, it is desirable to construct expression vectors 
which contain, at the minimum, a strong promoter to direct transcription, a ribosome 
binding site for translational initiation, and a transcription/translation terminator. One 

11 
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of skill would recognize that modifications can be made to a protein of the present 
invention without diminishing its biological activity. Some modifications may be 
made to facilitate the cloning, expression, or incorporation of the targeting molecule 
into a fusion protein. Such modifications are well known to those of skill in the art 

5 and include, for example, a methionine added at the amino terminus to provide an 
initiation site, or additional amino acids (e.g., poly His) placed on either terminus to 
create conveniently located restriction sites or termination codons or purification 
sequences. 

Prokaryotic cells may be used as hosts for expression. Prokaryotes most 

10 frequently are represented by various strains of E. coli; however, other microbial 
strains may also be used. Commonly used prokaryotic control sequences which are 
defined herein to include promoters for transcription initiation, optionally with an 
operator, along with ribosome binding site sequences, include such commonly used 
promoters as the beta lactamase (penicillinase) and lactose (lac) promoter systems 

1 5 (Chang et aL (1 977) Nature J 98: 1056), the tryptophan (trp) promoter system 
(Goeddel et aL ( 1 980) Nucleic Acids Res. 5:4057) and the lambda-derived P L 
promoter and N-gene ribosome binding site (Shimatake et aL (1981) Nature 292:12%). 
The inclusion of selection markers in DNA vectors transfected in E. coli is also 
useful. Examples of such markers include genes specifying resistance to ampicillin, 

20 tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Appropriate bacterial cells 
are infected with phage vector particles or transfected with naked phage vector DNA. 
If a plasmid vector is used, the bacterial cells are transfected with the plasmid vector 

25 DNA. Expression systems for expressing a protein of the present invention are 
available using Bacillus sp. and Salmonella (Palva et aL < 1 983) Gene 22:229-235; 
Mosbach et aL (1983) Nature 502:543-545). 

A variety of eukaryotic expression systems such as yeast, insect cell lines, 
plant and mammalian cells, are known to those of skill in the art. The sequences of 

30 the present invention can be expressed in these eiikary otic systems. In some 

embodiments, trarisformed/transfected plant cells are employed as expression systems 
for production of the proteins of the instant invention. 

12 
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Synthesis of heterologous proteins in yeast is well known. Sherman, F. et aL 
(1982) Methods in Yeast Genetics, Cold Spring Harbor Laboratory is a well 
recognized work describing the various methods available to produce tfef protein in 
yeast. Two widely utilized yeast for production of eukaryotic proteins are 
5 Saccharomyces cerevisia and Pichia pastor is. Vectors, strains, and protocols for 
expression in Saccharomyces and Pichia are known in the art and available from 
commercial suppliers (e.g., Invitrogen). Suitable vectors usually have expression 
control sequences, such as promoters, including 3 -phosphogly cerate kinase or alcohol 
oxidase, and an origin of replication, termination sequences and the like as desired. 
10 A protein of the present invention, once expressed, can be isolated from yeast 

by lysing the cells and applying standard protein isolation techniques to the ly sates. 
The monitoring of the purification process can be accomplished by using Western blot 
techniques or radioimmunoassay or other standard immunoassay techniques. 

The sequences encoding proteins of the present invention can also be ligated 
1 5 to various expression vectors for use in transfecting cell cultures of, for instance, 
mammalian, insect, or plant origin. Illustrative of cell cultures useful for the 
production of the peptides are mammalian cells. Mammalian cell systems often will 
be in the form of monolayers of cells although mammalian cell suspensions may also 
be used. A number of suitable host cell lines capable of expressing intact proteins 
20 have been developed in the art, and include the COS, HEK293, BHK2 1 ,. and CHO 
cell lines. Expression vectors for these cells can include expression control 
sequences, such as an origin of replication, a promoter (e.g., the CMV promoter, a 
HSV tk promoter or pgk (phosphoglycerate kinase promoter)), an enhancer (Queen et 
aL (1986) Immunol Rev. 59:49), and necessary processing information sites, such as 
25 ribosome binding sites, RNA splice sites, polyadenylation sites (e.g., an SV40 large T 
Ag poly A addition site), and transcriptional terminator sequences. Other animal cells 
useful for production of proteins of the present invention are available, for instance, 
from the American Type Culture Collection Catalogue of Cell Lines and Hybridomas 
(7th edition, 1992). A particular example of mammalian cells for expression of a Bt 
30 toxin receptor and assessing Bt toxin cytotoxicity mediated by the receptor, includes 
embryonic 293 cells. See U.S. Patent NO. 5,693,491, herein incorporated by 
reference. •• -'V 

. 13 . . . 
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Appropriate vectors for expressing proteins of the present invention in insect 
cells are usually derived from the SF9 baculovirus. Suitable insect cell lines include 
mosquito larvae, silkworm, armyworm, moth and Drosophila cell lines such as a 
Schneider cell line (See Schneider et ah (1987) J. Embryol Exp. Morphol 27: 

5 353-365). 

As with yeast, when higher animal or plant host cells are employed, 
polyadenylation or transcription terminator sequences are typically incorporated into 
the vector. An example of a terminator sequence is the polyadenylation sequence 
from the bovine growth hormone gene. Sequences for accurate splicing of the 

10 transcript may also be included. An example of a splicing sequence is the VP1 introri 
from SV40 (Sprague et at. (1 983) J. Virol 45:773-781). Additionally, gene 
sequences to control replication in the host cell may be incorporated into the vector 
such as those found in bovine papilloma virus-type vectors. Saveria-Campo, M., 
Bovine Papilloma Virus DNA a Eukaryotic Cloning Vector in DNA Cloning Vol II a 

1 5 Practical Approach, D.M. Glover, ed., IRL Pres, Arlington, Virginia pp. 21 3-238 
(1985). 

In a particular embodiment of the invention, it may be desirable to negatively 
coritrol receptor binding; particularly, when toxicity to a cell is no longer desired or if 
it is desired to reduce toxicity to a lower level. In this case, ligand-receptor 

20 polypeptide binding assays can be used to screen for compounds which bind to the 
receptor but do not confer toxicity to a cell expressing the receptor. The examples of 
a molecule that can be used to block ligand binding include an antibody that 
specifically recognizes the ligand binding domain of the receptor such that ligand 
binding is decreased or prevented as desired. 

25 In another embodiment, receptor polypeptide expression could be blocked by the 

use of antisense molecules directed against receptor RNA or ribozymes specifically 
targeted to this receptor RNA. It is recognized that with the provided nucleotide 
sequences, antisense constructions, complementary to at least a portion of the 
messenger RNA (mRNA) for the Bt toxin receptor sequences can be constructed. 

30 Antisense nucleotides are constructed to hybridize with the corresponding mRNA. 
Modifications of the antisense sequences may be made as long as the sequences 
hybridize to and interfere with expression of the corresponding mRNA. In this 
manner, antisense constructions having 70%, preferably 80%, more preferably 85% 
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sequence similarityto the corresponding antisensed sequences may be used. 
Furthermore, portions of the antisense nucleotides may be used to disrupt the 
expression of the target gene. Generally, sequences of at least 50 nucteotides, 100 
nucleotides, 200 nucleotides, or greater may be used. 
5 Fragments and variants of the disclosed nucleotide sequences and 

polypeptides encoded thereby are encompassed by the present invention. By 
"fragment" is intended a portion of the nucleotide sequence, or a portion of the amino 
acid sequence, and hence a portion of the polypeptide encoded thereby. Fragments of 
a nucleotide sequence may encode polypeptide fragments that retain the biological 
1 0 activity of the native polypeptide and, for example, bind Bt toxins. Alternatively, 
fragments of a nucleotide sequence that are useful as hybridization probes generally 
do not encode fragment polypeptides retaining biological activity. Thus, fragments of 
a nucleotide sequence may range from at least about 20 nucleotides, about 50 
nucleotides, about 100 nucleotides, and up to the full-length nucleotide sequence 
15 encoding the polypeptides of the invention. 

A fragment of a Bt toxin receptor nucleotide sequence that encodes a 
biologically active portion of a Bt toxin receptor polypeptide of the invention will 
encode at least 15, 25, 30, 50, 100, 150, 200 or 250 contiguous amino acids, or up to 
the total number of amino acids present in a full-length Bt toxin receptor polypeptide 
20 of the invention (for example, 1 7 1 7, 1 730, and 1 734 amino acids for SEQ ID NOs:2, 
4, and 6, respectively. Fragments of a 5r toxin receptor nucleotide sequence that are 
useful as hybridization probes for PCR primers generally need not encode a 
biologically active portion of a Br toxin receptor polypeptide. 

Thus, a fragment of a Bt toxin receptor nucleotide sequence may encode a 
25 biologically active portion of a Bt toxin receptor polypeptide, or it may be a fragment 
that can be used as a hybridization probe or PGR primer using methods disclosed 
below. A biologically active portion of a Bt toxin receptor polypeptide can be 
prepared by isolating a portion of one of the Bt toxin receptor nucleotide sequences of 
the invention, expressing the encoded portion of the Bt toxin receptor polypeptide 
30 (e.g., by recombinant expression in vitro), and assessing the activity of the encoded 
portion of the Bt toxin receptor polypeptide. Nucleic acid molecules that are 
fragments of a Bt toxin receptor nucleotide sequence comprise at least 16, 20, 50, 75, 
100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 800, 900, 1,000, 
-- - - • •- - IS ----- - - - 
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1,100, 1,200, 1,300, or 1,400 nucleotides, or up to the number of nucleotides present 
in a full-length Bt toxin receptor nucleotide sequence disclosed herein (for example, 
5498, 5527, and 5614 nucleotides for SEQ ID NOs: 1, 3, and 5, respectively). 

By "variants" is intended substantially similar sequences. For nucleotide 
5 sequences, conservative variants include those sequences that, because of the 

degeneracy of the genetic code, encode the amino acid sequence of one of the Bt toxin 
receptor polypeptides of the invention. Naturally occurring allelic variants such as 
these can be identified with the use of well-known molecular biology techniques, as, 
for example, with polymerase chain reaction (PGR) and hybridization techniques as 

10 outlined below. Variant nucleotide sequences also include synthetically derived 
nucleotide sequences, such as those generated, for example, by using site-directed 
mutagenesis, but which still encode a Bt toxin receptor protein of the invention. 
Generally, variants of a particular nucleotide sequence of the invention will have at 
least about 40%, 50%, 60%, 65%, 70%, generally at least about 75%, 80%, 85%, 

15 preferably at least about 90%, 91%, 92%, 93%, 94%, 95%,_96%, 97%, and more 
preferably at least about 98%, 99% or more sequence identity to that particular 
nucleotide sequence as determined by sequence alignment programs described 
elsewhere herein using default parameters. 

By "variant" protein is intended a protein derived from the native protein by 

20 deletion (so-called truncation) or addition of one or more amino acids to the N- 

terminal and/or C-terminal end of the native protein; deletion or addition of one or 
more amino acids at one or more sites in the native protein; or substitution of one or 
more amino acids at one or more sites in the native protein. Variant proteins 
encompassed by the present invention are biologically active, that is they continue to 

25 possess the desired biological activity of the native protein, that is, activity as 

described herein (for example, Bt toxin binding activity). Such variants may result 
from, for example, genetic polymorphism or from human manipulation. Biologically 
active variants of a native Bt toxin receptor protein of the invention will have at least 
about 40%, 50%, 60%, 65%, 70%, generally at least about 75%* 80%, 85%, 

30 preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, and more 
preferably at least about 98%, 99% or more sequence identity to the amino acid 
sequence for the native protein as determined by sequence alignment programs 
described elsewhere herein using default parameters. A biologically active variant of a 

16 
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protein of the invention may differ from that protein by as few as 1-15 amino acid 
residues, as few as 1-10, such as 6-10, as few as 5, as few as 4, 3, 2, or even 1 amino 
acid residue. 

The polypeptides of the invention may be altered in various ways including 
5 amino acid substitutions, deletions, truncations, and insertions. Methods for such 
manipulations are generally known in the art. For example, amino acid sequence 
variants of the Bt toxin receptor polypeptides can be prepared by mutations in the 
DNA. Methods for mutagenesis and nucleotide sequence alterations are well known 
in the art. See, for example, Kunkel (1985) Proc. Natl Acad. Set USA #2:488-492; 

10 Kunkel et al (1987) Methods in Enzymol 75*367-382; US Patent No. 4,873,192; 
Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan 
Publishing Company, New York) and the references cited therein. Guidance as to 
appropriate amino acid substitutions that do not affect biological activity of the 
protein of interest may be found in the model of Dayhoff et ah (1978) Atlas of Protein 

1 5 Sequence and Structure (Natl. Biomed * Res. Found., Washington, DC), herein 

incorporated by reference. Conservative substitutions, such as exchanging one amino 
acid with another having similar properties, may be preferable. 

Thus, the genes and nucleotide sequences of the invention include both the 
naturally occurring sequences as well as mutant forms. Likewise, the proteins of the 

20 invention encompass both naturally occurring proteins as well as variations and 
modified' forms thereof. Such variants will continue to possess the desired toxin 
binding activity. Obviously, the mutations that will be made in the DNA encoding the 
variant must not place the sequence out of reading frame and preferably will not 
. create complementary regions that could produce secondary mRNA structure. See, 

25 EP Patent Application Publication No. 75,444. 

The deletions, insertions, and substitutions of the protein sequences 
encompassed herein are not expected to produce radical changes in th6 characteristics 
of the protein. 

For example, it is recognized that at least about 10, 20, 50, 100, 150, 200, 250, 
30 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 8S0, 900, 950, and up'to 960 
amino acids may be deleted from the N-terminus of a polypeptide that has the amino 
acid sequence set forth in SEQ ID NO:2, and still retain binding function. "It is further 
recognized that it least about 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, aid up to 

17 
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1 1 9 amino acids may be deleted from the C-terminus of a polypeptide that has the 
amino acid sequence set forth in SEQ ID NO:2, and still retain binding function. 
Deletion variants of the invention that encompass polypeptides havrngnhese deletions. 
It is recognized that deletion variants of the invention that retain binding function 
5 encompass polypeptides having these N-terminal or C-terminal deletions, or having 
any deletion combination thereof at both the C- and the N-termini. 

However, when it is difficult to predict the exact effect of the substitution, 
deletion, or insertion in advance of doing so, one skilled in the art will appreciate that 
the effect will be evaluated by routine screening assays. That is, the activity can be 

10 evaluated by receptor binding and/or toxicity assays. See, for example, United States 
Patent NO: 5,693,49 1 ; T.P. Keeton et al (1 998) Appl Environ. Microbiol 
64(6):2 1 58-2 1 65; B.R. Francis et al (1997) Insect Biochem. Mol Biol 27(6):54U 
550; T.P. Keeton et al (1997) Appl Environ Microbiol $3(9)3419-3425; R.K. 
Vadlamudie/a/. (1995) J. Biol Chem. 270(1 ^:5490-5494; Iharaetal (1998) 

15 Comparative Biochem. Physiol B 720:197-204; Nagamatsu et al (1998) BioscL 
Biotechnol Biochem. 62(4):727-734, herein incorporated by reference. 

Variant nucleotide sequences and polypeptides also encompass sequences and 
polypeptides derived from a mutagenic and recombinogenic procedure such as DNA 
shuffling. With such a procedure, one or more different toxin receptor coding 

20 sequences can be manipulated to create a new toxin receptor, including but not limited 
to a new Bt toxin receptor^ possessing the desired properties. In this manner, libraries 
of recombinant polynucleotides are generated from a population of related sequence 
polynucleotides comprising sequence regions that have substantial sequence identity 
and can be homologously recombined in vitro or in vivo. For example, using this 

25 approach, sequence motifs encoding a domain of interest may be shuffled between the 
Bt toxin receptor gene of the invention and other known Bt toxin receptor genes to 
obtain a new gene coding for a polypeptide with an improved property of interest, 
such as an increased ligand affinity in the case of a receptor. Strategies for such DNA 
shuffling are known in the art. See,; for example, Stemmer (1 994) Proc. Natl Acad 

30 Scl USA 97:10747-10751; Stemmer (1994) Nature 570:389-391; Crameri etal 

(1997) Nature Biotech. 75:436-438; Moore et al (1997) J. Mol Biol 272:336-347; 
Zhang et al, (1997) Proc. Natl Acad Scl USA P¥:4504-4509; Crameri et al (1998) 
iSfa/ureiPy^SS^n and U S.jPatent Nos. 5,605,793 and 5,837,448. , 

18 
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Where the receptor polypeptides of the invention are expressed in a cell and 
associated with the cell membrane (for example, by a transmembrane segment), in order 
for the receptor of the invention to bind a desired ligand, for examples Cry! A toxin, 
the receptor's ligand binding domain must be available to the ligand. In this aspect, it is 

5 recognized that the native Bt toxin receptor of the invention is oriented such that the 
toxin binding site is available extracellularly. 

Accordingly, in methods comprising use of intact cells, the invention provides 
cell surface Bt-toxin receptors. By a "cell surface Bt toxin receptor" is intended a 
membrane-bound receptor polypeptide comprising at least one extracellular Bt toxin 

10 binding site. A cell surface receptor of the invention comprises an appropriate 

combination of signal sequences and transmembrane segments for guiding and retaining 
the receptor at the cell membrane such that that toxin binding site is available 
extracellularly. Where native Bt toxin receptors are used for expression, deduction of the 
composition and configuration of the signal sequences and transmembrane segments is 

1 5 not necessary to ensure the appropriate topology of the polypeptide for displaying the 
toxin binding site extracellularly. As an alternative to native signal and transmembrane 
sequences, heterologous signal and transmembrane sequences could be utilized to 
produce a cell surface receptor polypeptide of the invention. 

It is recognized that it may be of interest to generate Bt toxin receptors that are 

20 capable of interacting with the receptor's ligands intracellularly in the cytoplasm, in the 
nucleus or other organelles, in other subcellular spaces; or in the extracellular space. 
Accordingly, the invention encompasses variants of the receptors of the invention, 
wherein one or more of the segments of the receptor polypeptide is modified to target the 
polypeptide to a desired intra- or extracellular location. 

25 Also encompassed by the invention are receptor fragments and variants that are 

useful, among other things, as binding antagonists that will compete with a cell surface 
receptor of the invention. Such a fragment or variant can, for example, bind a toxin but 
not be able to confer toxicity to a particular cell. In this aspect, the invention provides 
secreted receptors, more particularly secreted Bt toxin receptors; or receptors that are not 

30 membrane bound. The secreted receptors of the invention can contain a heterologous or 
homologous signal sequence facilitating its secretion from the cell expressing the 
receptors; and further comprise a secretion variation in the region corresponding to 
transmembrane segments. By "secretion variation" is intended that amino acids 
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corresponding to a tranmembrne segment of a membrane bound receptor comprise one 
or more deletions, substitutions, insertions, or any combination thereof; such that the 
region no longer retains the requisite hydrophobicity to serve as a transmembrane 
segment. Sequence alterations to create a secretion variation can be tested by 
5 confirming secretion of the polypeptide comprising the variation from the cell 
expressing the polypeptide. 

. The polypeptides of the invention can be purified from cells that naturally 
express it, purified from cells that have been altered to express it (i. e. recombinant) or 
synthesized using polypeptide synthesis techniques that are well known in the art. In one 

1 0 embodiment, the polypeptide is produced by recombinant DNA methods. In such 

methods a nucleic acid molecule encoding the polypeptide is cloned into an expression 
vector as described more fully herein and expressed in an appropriate host cell according 
to known methods in the art. The polypeptide is then isolated from cells using 
polypeptide purification techniques well known to those of ordinary skill in the art. 

1 5 Alternatively, the polypeptide or fragment can be synthesized using peptide synthesis 
methods well known to those of ordinary skill in the art. 

The invention also encompasses fusion polypeptides in which one or more 
polypeptides of the invention are fused with at least one polypeptide of interest. In one 
embodiment, the invention encompasses fusion polypeptides in which a heterologous 

20 polypeptide of interest has an amino acid sequence that is not substantially homologous 

to the polypeptide of the invention. In this embodiment, the polypeptide of the invention 

i .... . - 

and the polypeptide of interest may or may not be operati vely linked. An example of 
operative linkage is fusion in-frame so that a single polypeptide is produced upon 
translation. Such fusion polypeptides can, for example, facilitate the purification of a 

25 recombinant polypeptide. 

In another embodiment, the fused polypeptide of interest may contain a 
heterologous signal sequence at the N-terminus facilitating its secretion from specific 
host cells. The expression and secretion of the polypeptide can thereby be increased by 
use of the heterologous signal sequence. 

30 The invention is also directed to polypeptides in which one or more domains in 

the polypeptide described herein are operatively linked to heterologous domains having 
homologous functions. Thus, the toxin binding domain can be replaced with a toxin 
binding domain for other toxins. Thereby, the; toxin specificity of the receptor is based 

20 
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on a toxin binding domain other than the domain encoded by Bt toxin receptor but other 
characteristics of the polypeptide, for example, membrane localization and topology is 
based on Bt toxin receptor. ~~ 

Alternatively, the native Bt toxin binding domain may be retained while 
5 additional heterologous ligand binding domains, including but not limited to 

heterologous toxin binding domains are comprised by the receptor. Thus, the invention 
also encompasses fusion polypeptides in which a polypeptide of interest is a 
heterologous polypeptide comprising a heterologous toxin binding domains. Examples 
of heterologous polypeptides comprising Cry 1 toxin binding domains include, but are 
] 0 not limited to Knight et al (1 994) Mol Micro 11: 429-436; Lee et al (1996) Appl 
Environ Micro 63: 2845-2849; Gill et al (1995) J Biol Chem 270: 27277-27282; 
Garczynski et al (1991) Appl Environ Microbiol 10: 2816-2820; Vadlamudi et al 
(1995) J Biol Chem 270(10): 5490-4, U.S. Patent No5,693>91 . 

The Bt toxin receptor peptide of the invention may also be fused with other 
1 5 members of the cadherin superfamily. Such fusion polypeptides could provide an 

important reflection of the binding properties of the members of the superfamily. Such 
combinations could be further used to extend the range of applicability of these 
molecules in a wide range of systems or species that might not otherwise be amenable to 
native or relatively homologous polypeptides. The fusion constructs could be substituted 
20 into systems in which a native construct would not be functional because of species 
specific constraints. Hybrid constructs may further exhibit desirable or unusual 
characteristics otherwise unavailable with the combinations of native polypeptides. 

Polypeptide variants encompassed by the present invention include those that 
contain mutations that either enhance or decrease one or more domain functions. For 
25 example, in the toxin binding domain, a mutation may be introduced that increases or 
decreases the sensitivity of the domain to a specific toxin. 

As an alternative to the introduction of mutations* increase in function may be 
provided by increasing the copy number of ligand binding domains. Thus, the invention 
also encompasses receptor polypeptides in which the toxin binding domain is provided 
30 in more than one copy. 

The invention further encompasses cells containing receptor expression vectors 
comprising the Bt toxin receptor sequences, and fragments and variants thereof. The 
expression vector can contain one or more expression cassettes used to transform a cell 

21 
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of interest Transcription of these genes can be placed under the control of a constitutive 
or inducible promoter (for example, tissue - or cell cycle-preferred). 

Where more than one expression cassette utilized, the cassette tharts additional 
to the cassette comprising at least one receptor sequence of the invention, can comprise 
5 either a receptor sequence of the invention or any other desired sequences. 

The nucleotide sequences of the invention can be used to isolate homologous 
sequences in insect species other than ostrinia, particularly other lepidopteran species, 
more particularly other Pyraloidea species. 

The following terms are used to describe the sequence relationships between 
1 0 two or more nucleic acids or polynucleotides: (a) "reference sequence*', (b) 

"comparison window", (c) "sequence identity", (d) "percentage of sequence identity", 
and (e) "substantial identity". 

(a) As used herein, "reference sequence" is a defined sequence used as a 
basis for sequence comparison. A reference sequence may be a subset or the entirety 

15 of a specified sequence; for example, as a segment of a full-length cDNA or gene 
sequence, or the complete cDNA or gene sequence. 

(b) As used herein, "comparison window" makes reference to a contiguous 
and specified segment of a polynucleotide sequence, wherein the polynucleotide 
sequence in the comparison window may comprise additions or deletions (i.e., gaps) 

20 compared to the reference sequence (which does not comprise additions or deletions) 
for optimal alignment of the two sequences. Generally, the comparison window is at 
least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or 
longer. Those of skill in the art understand that to avoid a high similarity to a 
reference sequence due to inclusion of gaps in the polynucleotide sequence a giap 

25 penalty is typically introduced and is subtracted from the number of matches. 

Methods of alignment of sequences for comparison are well known in the art. 
Thus, the determination of percent identity between any two sequences can be 1 
accomplished using a mathematical algorithm. Non-limiting (examples of such 
mathematical algorithms are the algorithm of Myers and Miller (1988) CABIOS4 A 1- 

30 17; the local homology algorithm of Smith el at. (1981) Adv. Appl Math 2:482; the 
homology alignment algorithm of Needleman ahd Wunsch (1970) J. Mol BiolA 
45:443-453; the search-for-similarity-method of Pearson and Lipmari (1988) \Proc. 
Natl. Acad. Sci. #5:2444-2448; the algorithm of Karlin and Altechul (1990) Prot. 

22 
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Natl. Acad. Sci. USA 872264, modified as in Karlin and Altschul (1 993) Proc. Natl. 
Acad. Sci. USA 90:5873-5877. 

Computer implementations of these mathematical algorithms canbe utilized 
for comparison of sequences to determine sequence identity. Such implementations 
5 include, but are not limited to: CLUSTAL in the PC/Gene program (available from 
Intelligenetics, Mountain View, California); the ALIGN program (Version 2.0); the 
ALIGN PLUS program (version 3.0, copyright 1997); and GAP, BESTFIT, BLAST, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 
(available from Genetics Computer Group (GCG), 575 Science Drive, Madison, 
1 0 Wisconsin, USA). Alignments using these programs can be performed using the 
default parameters. The CLUSTAL program is well described by Higgins etal. 
(1988) Gene 75:237-244 (1988); Higgins et al. (1989) CABIOS 5:151-153; Corpet et 
al. (1988) Nucleic Acids Res. /<*: 1088 1-90; Huang et al (1992) CABIOS 8: 155-65; 
and Pearson et al. (1994) Meth. Mol. Biol. 2*307-331. The ALIGN and the ALIGN 
1 5 PLUS programs are based on the algorithm of Myers and Miller (1 988) supra. A 

PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can 
be used with the ALIGN program when comparing amino acid sequences. The 
BL AST programs of Altschul et al (1990) J. Mol. Biol. 275:403 are based on the 
algorithm of Karlin and Altschul (1990) supra. BLAST nucleotide searches can be 
20 performed with the BLASTN program, score = 1 00, wordjength - 12, to obtain 

nucleotide sequences homologous to a nucleotide sequence encoding a protein of the 
invention. BLAST protein searches can be performed with the BLASTX program, 
score = 50, wordlength = 3, to obtain amino acid sequences homologous to a protein 
or polypeptide of the invention. To obtain gapped alignments for comparison 
25 purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul et 
al. (1997) Nucleic Acids Res. 25:3389. Alternatively, PSI-BLAST (in BLAST 2.0) 
can be used to perform an iterated search that detects distant relationships between 
molecules. See Altschul et al. (1997) supra. When utilizing BLAST, Gapped BLAST, 
PSI-BLAST, the default parameters of the respective programs (e.g., BLASTN for 
30 nucleotide sequences, BLASTX for proteins) can be used. See 

http://www.ncbi.hlm.nih.gov. Alignment may also be performed manually by 
•■inspection. ■• ■ ' ''' 
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Unless otherwise stated, sequence identity/similarity values provided herein 
refer to the value obtained using GAP Version 10 using the following parameters: % 
identity using GAP Weight of 50 and Length Weight of 3; % similarity using Gap 
Weight of 12 and Length Weight of 4, or any equivalent program. By "equivalent 
5 program" is intended any sequence comparison program that, for any two sequences 
in question, generates an alignment having identical nucleotide or amino acid residue 
matches and an identical percent sequence identity when compared to the 
corresponding alignment generated by the preferred program. 

GAP uses the algorithm of Needleman and Wunsch (1 970) J. MoL Biol 48: 
10 . 443-453, to find the alignment of two complete sequences that maximizes the number 
of matches and minimizes the number of gaps. GAP considers all possible alignments 
and gap positions and creates the alignment with the largest 

number of matched bases and the fewest gaps. It allows for the provision of a gap 
creation penalty and a gap extension penalty in units of matched bases. GAP must 

1 5 make a profit of gap creation penalty number of matches for each gap it inserts. If a 
gap extension penalty greater than zero is chosen, GAP must, in addition, make a 
profit for each gap inserted of the length of the gap times the gap extension penalty. 
Default gap creation penalty values and gap extension penalty values in Version 10 of 
the Wisconsin Genetics Software Package for protein sequences are 8 and 2, 

20 respectively. For nucleotide sequences the default gap creation penalty is 50 while 
' the default gap extension penalty is 3 . The gap creation and gap extension penalties 
can be expressed as an integer selected from the group of integers consisting of from 
0 to 200. Thus, for example, the gap creation and gap extension penalties can be 0, 1 , 
2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65 or greater. 

25. ; r • /GAP presents one member of the family of best alignments. There may be 
many members of this family, but no other member has a better quality. GAP 
displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity. 
The Quality is the metric maximized in order to align the sequences. Ratio is the 
quality divided by the number of bases in the shorter segment. Percent Identity is the 

30 percent of the symbols that actually match. Percent Similarity is the percent of the 
r symbols that are similar. Symbols that are across from gaps are ignored. A similarity 
- : .} is scored whfch the scoring matrix value for a pair of symbols is greater than or equal 
/ - to 0.50, the similarity threshold. The scoring matrix used in Version 10 of the 
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Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff and Henikoff 
(1989)J > roc. Natl Acad. Sci. USA 89:10915). 

(c) As used herein, "sequence identity" or "identity" in the coritext of two 
nucleic acid or polypeptide sequences makes reference to the residues in the two 
5 sequences that are the same when aligned for maximum correspondence over a 
specified comparison window. When percentage of sequence identity is used in 
reference to proteins it is recognized that residue positions which are not identical 
often differ by conservative amino acid substitutions, where amino acid residues are 
substituted for other amino acid residues with similar chemical properties (e.g., charge 
1 0 or hydrophobicity) and therefore do not change the functional properties of the 

molecule. When sequences differ in conservative substitutions, the percent sequence 
identity may be adjusted upwards to correct for the conservative nature of the 
substitution. Sequences that differ by such conservative substitutions are said to have 
"sequence similarity" or "similarity". Means for making this adjustment are well 
1 5 known to those of skill in the art. Typically this involves scoring a conservative 

substitution as a partial rather than a full mismatch, thereby increasing the percentage 
sequence identity. Thus, for example, where an identical amino acid is given a score 
of 1 and a non-conservative substitution is given a score of zero, a conservative 
substitution is given a score between zero and 1. The scoring of conservative 
20 substitutions is calculated, e ? g., as implemented in the program PC/GENE 
(Intelligenetics, Moiintain View, California). « 

(d) As used herein, "percentage of sequence identity" means the value 
determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polynucleotide sequence in the comparison 

25 window may comprise additions or deletions (i.e., gaps) as compared to the reference 
sequence (which does not comprise additions or deletions) for optimal alignment of 
the two sequences. The percentage is calculated by determining the number of 
positions at which the identical nucleic acid base or amino acid residue occurs in both 
sequences to yield the number of matched positions, dividing the number of matched 

30 positions by the total number of positions in the window of comparison, and 
multiplying the result by 1 00 to yield the percentage of sequence identity. 

(e) (i) The term "substantial identity" of polynucleotide sequences means that 
a polynucleotide comprises a sequence that has at least 70% sequence identity, 

•25 
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preferably at least 80%, more preferably at least 90%, and most preferably at least 
95%, compared to a reference sequence using one of the alignment programs 
described using standard parameters. One of skill in the art will recognize that these 
values can be appropriately adjusted to determine corresponding identity of proteins 
5 encoded by two nucleotide sequences by taking into account codon degeneracy, 

amino acid similarity, reading frame positioning, and the like. Substantial identity of 
amino acid sequences for these purposes normally means sequence identity of at least 
60%, more preferably at least 70%, 80%, 90%, and most preferably at least 95%. 

Another indication that nucleotide sequences are substantially identical is if 

.10 two molecules hybridize to each other under stringent conditions. Generally, 

stringent conditions are selected to be about 5°C lower than the thermal melting point 
(T m ) for the specific sequence at a defined ionic strength and pH. However, stringent 
conditions encompass temperatures in the range of about 1°C to about 20°C lower 
than the T m , depending upon the desired degree of stringency as otherwise qualified 
1 5 herein. Nucleic acids that do not hybridize to each other under stringent conditions 
are still substantially identical if the polypeptides they encode are substantially 
identical. This may occur, e.g., when a copy of a nucleic acid is created using the 
maximum codon degeneracy permitted by the genetic code. One indication that two 
nucleic acid sequences are substantially identical is when the polypeptide encoded by 

20 the first nucleic acid sequence is immunologically cross reactive with the polypeptide 
encoded by the second nucleic acid sequence. 

(eXii) The term "substantial identity** in the context of a peptide indicates that 
a peptide comprises a sequence with at least 70% sequence identity to a reference 
sequence, preferably 80%, more preferably 85%, most preferably at least 90% or 95% 

, 25 sequence identity to the reference sequence over a specified comparison window. 

Preferably, optimal alignment is conducted using the homology alignment algorithm 
of Needleman and Wunsch (1970) J. Mol Biol 45:443-453. An indication that two 
peptide sequences are substantially identical is that one peptide is immunologically 
reactive with antibodies raised against the second peptide. Thus, a peptide is 

30 substantially identical to a second peptide, for example, where the two peptides differ 
only by a conservative substitution. Peptides that are "substantially similar" share 
sequences as noted above except that residue positions that are not identical riiay 
differ by conservative amino acid changes. 
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The nucleotide sequences of the invention can be used to isolate 
corresponding sequences from other organisms, particularly other insects, more 
particularly other lepidopteran species. In this manner, methods such^PCR, 
hybridization, and the like can be used to identify such sequences based on their 
5 sequence homology to the sequences set forth herein. Sequences isolated based on 
their sequence identity to the entire Bt toxin receptor sequences set forth herein or to 
fragments thereof are encompassed by the present invention. Such sequences include 
sequences that are orthologs of the disclosed sequences. By "orthologs" is intended 
genes derived from a common ancestral gene and which are found in different species 

10 as a result of speciation. Genes found in different species are considered orthologs 
when their nucleotide sequences and/or their encoded protein sequences share 
substantial identity as defined elsewhere herein. Functions of orthologs are often 
highly conserved among species. 

In a PCR approach, oligonucleotide primers can be designed for use in PCR 

1 5 reactions to amplify corresponding DN A sequences from cDNA or genomic DNA 
extracted from any organism of interest. Methods for designing PCR primers and 
PCR cloning are generally known in the art and are disclosed in Sambrook et ah 
(1 989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 
Laboratory Press, Plainview, New York). See also Innis et aL 9 eds. (1990) PCR 

20 Protocols: A Guide to Methods and Applications (Academic Press, New York); Innis 
and Gelfand, eds. (1995) PCR Strategies (Academic Pifess, New York); and Innis and 
Gelfand, eds. (1999) PCR Methods Manual (Academic Press, New York). Known 
methods of PCR include, but are not limited to, methods using paired primers, nested 
primers, single specific primers, degenerate primers, gene-specific primers, vector- 

25 specific primers, partially-mismatched primers, and the like. 

In hybridization techniques^ all or part of a known; nucleotide sequence is used 
as a probe that selectively hybridizes to other corresponding nucleotide sequences 
present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., 
genomic or cDNA libraries) from a chosen organism. The hybridization probes may 

30 be genomic DNA fragments, cDNA fragments, RNA fragments, or other 

oligonucleotides, and may be labeled with a detectable group such as 32 P, or any other 
. : , detectable marker / Thus, for example, probes for hybridization can be made by 
labeling synthetic oligonucleotides based on the Bt toxin receptor sequences of the 
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invention. Methods for preparation of probes for hybridization and for construction of 
cDNA and genomic libraries are generally known in the art and are disclosed in 
Sambrook et aL (1989) Molecular Cloning: A Laboratory Manual (2d a Hl~ Cold 
Spring Harbor Laboratory Press, Plainview, New York). 
5 For example, the entire Bt toxin receptor sequence disclosed herein, or one or 

more portions thereof, may be used as a probe capable of specifically hybridizing to 
corresponding Bt toxin receptor sequences and messenger RNAs. To achieve specific 
hybridization under a variety of conditions, such probes include sequences that are 
unique among Bt toxin receptor sequences and are preferably at least about 10 
10 nucleotides in length, and most preferably at least about 20 nucleotides in length. 

Such probes may be used to amplify corresponding Bt toxin receptor sequences from 
a chosen plant organism by PGR. This technique may be used to isolate additional 
coding sequences from a desired organism or as a diagnostic assay to determine the 
presence of coding sequences in an organism. Hybridization techniques include 
15 hybridization screening of plated DNA libraries (either plaques or colonies; see, for 
example, Sambrook et ah (1989) Molecular Cloning: A Laboratory Manual (2d ed., 
Cold Spring Harbor Laboratory Press, Plainview, New York). 

Hybridization of such sequences may be carried put under stringent 
conditions. By "stringent conditions'* or "stringent hybridization conditions" is 
20 intended conditions under which a probe will hybridize to its target sequence to a 
detectably greater degree than to other sequences (e.g., at least 2-fold over 
background). Stringent conditions are sequence-dependent and will be different in 
different circumstances. By controlling the stringency of the hybridization and/or 
washing conditions, target sequences that are 100% complementary to the probe can 
25 be identified (homologous probing). Alternatively, stringency conditions can be 

adjusted to allow some mismatching in sequences so that lower degrees of similarity 
are detected (heterologous probing). Generally, a probe is less than about 1000 
nucleotides in length, preferably less than 500 nucleotides in length. 

Typically, stringent conditions will be those in which the salt concentration is 
30 less than about IjS M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes 
(e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 
50 nucleotides). Stringent conditions may also be achieved with the addition of 
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destabilizing agents such as formamide. Exemplary low stringency conditions 
include hybridization with a buffer solution of 30 to 35% formamide, 1 M Nad, 1% 
SDS (sodium dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC<20X SSC = 3.0 
M NaCl/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate stringency 
5 conditions include hybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 
37°C, and a wash in 0.5X to IX SSC at 55 to 60°C. Exemplary high stringency 
conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37°C, and 
a wash in 0.1X SSC at 60 to 65°C. Duration of hybridization is generally less than 
about 24 hours, usually about 4 to about 12 hours. 

1 0 Specificity is typically the function of post-hybridization washes, the critical 

factors being the ionic strength and temperature of the final wash solution. For DN A- 
DNA hybrids, the T m can be approximated from the equation of Meinkoth and Wahl 
(1984) Anal Biochem. 755:267-284: T m = 81 .5°C + 16.6 (log M) + 0.41 (%GC) - 
0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC is the 

1 5 percentage of guanosine and cytosine nucleotides in the DN A, % form is the 

percentage of formamide in the hybridization solution, and L is the length of the 
hybrid in base pairs. The T m is the temperature (under defined ionic strength and pH) 
at which 50% of a complementary target sequence hybridizes to a perfectly matched 
probe. T m is reduced by about 1°C for each 1% of mismatching; thus, T m , 

20 hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the 
desired identity. For example, if sequences with >90% identity are sought, the T m can 
be decreased 1 0°C. Generally, stringent conditions are selected to be about 5°C lower 
than the thermal melting point (T m ) for the specific sequence and its complement at a 
defined ionic strength and pH. However, severely stringent conditions can utilize a 

25 hybridization and/or wash at 1 , 2, 3, or 4°C lower than the thermal melting point (T m ); 
moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 
10°C lower than the thermal melting point (T m ); low stringency conditions can utilize 
a hybridization and/or wash at 1 1, 12, 13, 14, 15, or 20°C lower than the thermal 
melting point (T m ). Using the equation^ hybridization and wash compositions, and 

30 desired T m , those of ordinary skill will understand that variations in the stringency of 
hybridization and/or wash solutions are inherently described. If the desired degree of 
mismatching results in a T m of less than 45°C (aqueous solution) or 32?G (formamide 
solution), it is preferred to increase the SSC concentration so that a higher temperature 

29 
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can be used. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes, Part I, Chapter 2 (Elsevier; NewYork); and 
Ausubel et aL, eds. (1995) Current Protocols in Molecular Biology, Chapter 2 
5 (Greene Publishing and Wiley-Interscience, New York). See Sambrook et aL (1989) 
Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory 
Press, Plainview, New York). 

Thus, isolated sequences that encode for a Bt toxin receptor protein and which 
hybridize under stringent conditions to the Bt toxin receptor sequences disclosed 

10 herein, or to fragments thereof, are encompassed by the present invention. Such 

sequences will be at least about 40% to 50% homologous, about 60%, 65%, or 70% 
homologous, and even at least about 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99% or more homologous with ttofdisclosed sequences. That 
is, the sequence identity of sequences may range, sharing at least about 40% to 50%, 

1 5 about 60%, 65%, or 70%, and even at least about 75%, 80%, 85%, 90%, 9 1 %, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity. 

The compositions and screening methods of the invention are useful for 
identifying cells expressing the BT toxin receptors of the invention, and variants and 
homologues thereof. Such identification could utilize detection methods at the protein 

20 level, such as ligand-receptor binding; or at the nucleotide level. Detection of the 

polypeptide could be in situ by means of in situ hybridization of tissue sections but may 
also be analyzed by bulk polypeptide purification and subsequent analysis by Western 
blot or immunological assay of a bulk preparation. Alternatively, receptor gene 
expression can be detected at the nucleic acid level by techniques well known to those of 

25 ordinary skill in any art using complimentary polynucleotides to assess the levels of 

genomic DN A, mRNA, and the like. As an example, PCR primers complimentary to the 
nucleic acid of interest can be used to identify the level of expression. Tissues and cells 
identified as expressing the receptor sequences of the invention are determined to be 
susceptible to toxins which bind the receptor polypeptides. 

30 Where the source of the cells identified to express the receptor polypeptides of 

the invention is an organism, for example an insect plant pest, the organism is 
determined to be susceptible to toxins capable of binding the polypeptides. In a 
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particular embodiment, identification is in a lepidopteran plant pesr expressing the Bt 
toxin receptor of the invention. 

The invention encompasses antibody preparations with specificity^gainst the 
polypeptides of the invention. In further embodiments of the invention, the antibodies 
5 are used to detect receptor expression in a cell. 

In one aspect, the invention is particularly drawn to compositions and 
methods for modulating susceptibility of plant pests to Bt toxins. However, it is 
recognized that the methods and compositions could be used for modulating 
susceptibility of any cell or organism to the toxins. By "modulating" is intended that 
1 0 the susceptibility of a cell or organism to the cytotoxic effects of the toxin is increased 
or decreased. By "suceptibility" is intended that the viability of a cell contacted with 
the toxin is decreased. Thus the invention encompasses expressing the cell surface 
receptor polypeptides of the invention to increase susceptibility of a target cell or 
organ to Bt toxins. Such increases in toxin susceptibility are useful for medical and 
1 5 veterinary purposes in which eradication or reduction of viability of a group of cells is 
desired. Such increases in susceptibility are also useful for agricultural applications in 
which eradication or reduction of population of particular plant pests is desired. 

Plant pests of interest include, but are not limited to insects, nematodes, and 
the like. Nematodes include parasitic nematodes such as root-knot, cyst, lesion, and 
20 renniform nematodes, etc. 

The following examples are offered by way of illustration and not by way of 
limitation. 



EXPERIMENTAL 
25 EXAMPLE 1 : Isolation of EC Bt toxin receptor 

Standard recombinant methods well known to those of ordinary skill in the art 
were carried out For library construction, total RNA was isolated from the midgut of 
European com borer (ECB), Ostrinia nubilalis. Com borer larvae (for example, a mix of 
stage 2, 3, and 4, equal weight) can be pulverized in liquid nitrogen, homogenized, and 
30 total RNA extracted by standard procedures. PolyA RNA can be isolated from the total 
RNA with standard PolyA isolation procedures; such as the PolyATact system from 
Promega Corporation, Madison, WI. cDNA synthesis can then be performed and, for 
example, unidirectional cDNA libraries can be constructed according to known and 
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commercial procedures, such as the ZAP Express cDNA synthesis kit from Stratagene, 
La jolla, CA. cDNA can be amplified by PGR, sized and properly digested with 
restriction fragments to be ligated into a vector. Subcloned cDNA can be sequenced to 
identify sequences with the proper peptide to identity corresponding to published 
5 sequences. These fragments can be used to probe genomic or cDNA libraries 

corresponding to a specific host, such as Ostrinia nubilalis, to obtain a full length coding 
sequence. Probes can also be made based on Applicants disclosed sequences. The 
coding sequence can then be ligated into a desired expression cassette and used to 
transform a host cell according to standard transformation procedures. Such an 

10 expression cassette can be part of a commercially available vector and expression 
system; for example, the pET system from Novagen Inc. (Madison, WI). Additional 
vectors that can be used for expression include pBKCMV, pBKRSV, pPbac and pMbac 
(Stratagene Inc.), pFASTBacl (Gibco BRL) and other common bacterial, baculovirus, 
mammalian, and yeast expression vectors. 

15 All vectors were constructed using standard molecular biology techniques as 

described for example in Sambrook et aL 9 (1989) Molecular Cloning: A Laboratory 
Manual (2 nd ed., Cold Spring Harbor Laboratory: Cold Spring Harbor, N.Y.). 

Expression is tested by ligand blotting and testing for Bt toxin binding. Ligand 
blotting, binding, and toxicity are tested by known methods; for example, as described in 

20 Martinez-Ramirez (1994) Biochem. Biophys. Res. Comm. 201: 782-787; Vadlamudi et 
al (1995) J Biol Chem 270(1 0;:549O-4, Keeton et al (1998) Appl Environ Microbiol 
(5^:2158-2165; Keeton et al (1997) Appl Environ Microbiol 63 (9) :3419-3425; 
Ihara et al (1998) Comparative Biochemistry dnd Physiology, Part B 120: 197-204; 
Nagamatsu et al (1998) Biosci. Biotechnol. Biochem. 62(4) J\ 8-726 and Nagamatsu 

25 et al (1998) Biosci. Biotechnol Biochem. 52^:727-734. 

Identifying the CrylA(b) binding polypeptide in ECB was done by ligand 
blotting brush border membrane vesicle polypeptides and probing those polypeptides 
for binding with CrylA(b) toxin. Two polypeptides, approximately 210 and 205 kDa, 
were found to bind to CrylA(b). Plotting and binding were done essentially as 

30 described in the preceding paragraph. 

Degenerate primers for RT-PCR were designed based on known Cry! toxin 
binding polypeptide sequences from Manducca sexta and Bombyx mori. • .The primers 
are shown below. cDNA was constructed from total midgut RNA (cDNA synthesis 
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kit GibcoBrL). Degenerate primers were used to amplify products of the expected 
size. The annealing temperature used was 53°C in generation of the 280 bp fragment 
and 55°C when generating the 1.6 kb fragment. : "~ ' 

A 280bp fragment was obtained from ECB midgut RNA. Upon cloning and 
5 sequencing, the fragment was identified as having homology with the Bt toxin 
receptor 1 polypeptide (BTR1) described in Vadlamudi et ah (1995) J Biol Chem 
270(1 0;:5490-4. 

A similar approach was used to generate a 1 .6 kilobase pair clone. The 
sequence of primers used to generate the 280 base pair fragment were: 
10 Primer BTRD1S: 5XjTTAMYGTGAGAGAGGCAGAYCC3' (SEQ ID NO:8), and 
Primer BTRD5A: 5'GGATRTTAAGMGTCAG YAC WCCG3* (SEQ ID NO:9). 
The sequence of primers used to generate the 1.6 kb fragment were: 
Primer BTRD6S: 5TCCG AATTCTTCTTY A ACCTC ATCG A Y AACTT3 1 (SEQ ID 
NO: 10), and 

1 5 Primer BTRD7A: 5 , CGCAAGCTTACTTGGTCGATGTTRCASGTCAT3 , (SEQ ID 
NO:ll) 

The 1.6 kb fragment clone was ligated in an E. coli expression vector, pET- 
28a-c(+), and expressed using the pET system (Novagen Inc., Madison, WI). Purified 
polypeptide encoded by this 1 .6kb fragment demonstrated binding to CrylA(b) in 

20 ligand blots. An ECB midgut cDN A library was generated and screened using this 

1 .6kb clone, generating 120 positive plaques. Thirty of these plaques were chosen for 
secondary screening and fifteen of those plaques were purified and sent for DNA 
sequencing. ■ f 

The obtained nucleotide sequence of the selected Bt toxin receptor clone from 

25 ECB is set forth in SEQ ID NO: 1. The total length of the clone is 5498 base pairs. 
The coding sequences are residues 162-5312. The Cryl A binding site is encoded by 
residues 4038-4547. The predicted transmembrane domain is encoded by residues 
4872-4928. The corresponding deduced amino acid sequence for this Bt toxin receptor 
clone from ECB is set forth in SEQ ID NO: 2, 

30 The purified polypeptide generated from the 1 .6kb fragment set forth in SEQ 

ID NO:7 was used to inoculate rabbits for the production of polyclonal antibodies. On 
zoo western blots prepared from brush border membrane Vesicles from various insect 
species, this set of antibodies specifically recognized ECB Bt toxin receptor 
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polypeptides, in comparison to Bt toxin receptor homologues polypeptides from other 
insect species. Rabbit polyclonal antibodies were also raised from a purified 
polypeptide corresponding to amino acids 1293-1462 of SEQ ID NG:2.~~ 

5 Example 2: Isolation of CEW and FA W Bt toxin receptor orthologues: 

cDNA encoding a full-length Bt toxin receptor from corn earworm (CEW, 
Heliothis Zed) was isolated. The nucleotide sequence for this cDNA is set forth in SEQ 
ID NO: 3. Nucleotides 1 71-5360 correspond to the open reading frame. Nucleotides 
491 7-4973 correspond to the transmembrane region. Nucleotides 4083-4589 correspond 
1 0 to the CiylA binding site. The deduced corresponding amino acid sequence for the 
CEW Bt toxin receptor is set forth in SEQ ID NO: 4. 

cDNA encoding a full-length Bt toxin receptor from fall armyworm (FA W, 
Spodopterafrugiperdd) was isolated. The nucleotide sequence for this cDNA is set forth 
in SEQ ID NO: 5. Nucleotides 162-5363 correspond to the open reading frame. 
1 5 Nucleotides 4 1 1 0-46 1 6 correspond to the Cryl A binding site. Nucleotides 494 1 -4997 
correspond to the transmembrane region. Nucleotides 1 62-227 correspond to a signal 
peptide. The deduced corresponding amino acid sequence for the FAW Bt toxin 
receptor is set forth in SEQ ID NO: 6. 

20 Example 3: Binding and cell death in lepidopteran insect cells expressing the Bt toxin 
receptors of the invention: 

An in vitro system is developed to demonstrate the functionality of a Bt toxin 
receptor of the invention. The results disclosed in this example demonstrate that the 
ECB Bt toxin receptor of the invention (SEQ ID NOs:l and 2) is specifically involved 

25 in the binding and killing action of Cry 1 Ab toxin. 

Well known molecular biological methods are used in cloning and expressing 
the ECB Bt toxin receptor in Sf9 cells. A baculovirus expression system (Gibco BRL 
Catalogue No. 10359-016) is used according to the manufacturer's provided protocols 
and as described below. S. frugiperda (Sf9) cells obtained from ATCC (ATCC-CRL 

30 1 71 1) are grown at 27°C in Sf-900 II serum free medium (Gibco BRL, Catalogue No. 
10902-088). These cells, which are not susceptible to Cryl Ab toxin, are transfected 
with an expression construct (pFastBac l bacmid, Gibco BRL catalogue NO. 10360- 
014) comprising an operably linked Bt toxin receptor of the invention (SEQ ID NO:l) 
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downstream of a polyhedriri promoter. Transfeeted SO cells express the ECB Bt toxin 
receptor and are lysed in the presence of CrylAb toxin. Toxin specificities, binding 
parameters, such as Kd values, and half maximal doses for cellular death and/or 
toxicity are also determined. 
5 For generating expression constructs, the ECB g t toxin receptor cDNA (SEQ 

ID NO:l) is subjected to appropriate restriction digestion, and the resulting cDNA 
comprising the full-length coding region is ligated into the donor plasmid pFastBacl 
multiple cloning site. Following transformation and subsequent transposition, 
recombinant bacmid DNA comprising the ECB Bt toxin receptor (RBECBl) is 

10 isolated. As a control, recombinant bacmid DNA comprising the reporter gene 
glucuronidase (RBGUS) is similarly constructed and isolated. 

For transfection, 2p£ each RBECBl or RBGUS DNA is mixed with 6 ^1 of 
CellFectin (GibcoBRL catalogue NO. 10362-010) in 100 ^1 of Sf900 medium, and 
incubated at room temperature for 30 minutes. The mixture is then diluted with 0.8 ml 

15 Sf-900 medium. Sf9 cells (10 6 /ml per 35 mm well) are washed once with Sf-900 

medium, mixed with the DNA/CellFectin mixture, added to the well, and incubated at 
room temperature for 5 hours. The medium is removed and 2 ml of Sf-900 medium 
containing penicillin and streptomycin is added to the well. 3-5 days after 
transfection, Western blotting is used to examine protein expression. 

20 For Western blotting,100 ^1 of cell lysis buffer (50 mM f ris, pH7.8, 150mM 

NaCl, 1% Nonidet P-40) is added to the well. The cells are scraped and subjected to 
16,000xg centrifugation. Pellet and supernatant are separated and subjected to 
Western blotting. An antibody preparation against ECB st toxin receptor (Example 
1) is used as first antibody. Alkaline phbsphatase-labelled anti-rabbit IgG is used as 

25 secondary antibody. Western blot results indicate that the full length ECB Bt toxin 
receptor of the invention (SEQ ID NOs: 1 and 2) is expressed in the cell membrane of 
these cells. 

For determining GUS activity, the medium of the cells transfeeted with 

RBGUS is removed. The cells and the medium are separately mixed with GUS 

30 substrate and assayed for the well known enzymatic activity. GUS activity assays 

indicate that this reporter gene is actively expressed in the transfeeted cells. 

For determining toxin susceptibility, Cry toxins including but not limited to 

CrylA, CrylB, CrylC, CrylD, CrylE, CrylF, Cryll, Cry2, Cry3, and bry9 toxins 
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(Schnepf E. et al. (1998) Microbiology and Molecular Biology Reviews 62(3): 775- 
806) are prepared by methods known in the art. Crystals are dissolved in pH 10.0, 50 
mM carbonate buffer and treated with trypsin. Active fragments of Cry proteins are 
purified by chromatography. Three to five days after transfection, cells are washed 
5 with phosphate buffered saline (PBS). Different concentrations of active fragments of 
Cry toxins are applied to the cells. At different time intervals, the cells are examined 
under the microscope to readily determine susceptibility to the toxins. Alternatively, 
cell death, viability and/or toxicity is quantified by methods well known in the art. 
See, for example, In Situ Cell Death Detection Kits available from Roche 
10 Biochemicals (Catalogue Nos. 2 156 792, 1 684 809, and 1 684 817), and 

LIVE/DEAD® Viability/Cytotoxicity Kit available from Molecular Probes (catalogue 
No. L-3224). 

A dose-dependent response of RBECB1 -transfected cells to Cryl Ab is readily 
observed, with determined Kd values well within the range for many receptors. 

15 Control cells, e.g. those transfected with pFastBacl bacmid without an insert or those 
transfected with RBGus are not significantly affected by Cryl Ab. Interaction with 
other Cry toxins are similarly characterized. 

This in vitro system is not only be used to verify the functionality of putative 
tfMoxin receptors, but also used as a tool to determine the active site(s) and other 

20 functional domains of the toxin and the receptor. Furthermore, the system is used as a 
cell-based high throughput screen. For example, methods for distinguishing live 
versus dead cells by differential dyes are known in the art. This allows for aliqiiots of 
transfected cells to be treated with various toxin samples and to serve as a means for 
screening the toxin samples for desired specificity or binding characteristics: Since the 

25 system is used to identify the specificity of Cry protein receptors, it is a useful tool in 
insect resistance management 

Example 4: Expression of the ECB Bt toxin receptor in toxin susceptible stages of the 
insect's life cycle: 

30 Total RNA was isolated from the eggs, pupae, adults, and the 1 st through the 

5th instar developmental stages, using TRIzol Reagent (Gibco BRL) essentially as 
instructed by the manufacturer.(Gibco BRL). The RNA was quantitated and 20 ug of 
each sample was loaded onto a formaldehyde agarose gel and electrophoresed at 
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constant voltage. The RNA was then transferred to a nylon membrane via neutral 
capillary transfer and cross-linked to the membrane using ultraviolet light. For 
hybridization, a 460 base pair ECB Bt toxin receptor DNA probe (bases : 3~682 to 4141 
in SEQ ID NO:l) was constructed from a 460 base pair fragment prepared according 
5 to the manufacturer's protocol for Amersham Rediprime II random prime labeling 
system. The denatured probe was added to the membrane that had been prehybridized 
for at least 3 hours at 65°C and allowed to incubate with gentle agitation for at least 
12 hours at 65°C. Following hybridization, the membranes were washed at 65°C for 
1 hour with 1/4X 0.5M NaCl, 0.1M NaP04 (ph 7.0), 6mM EDTA and 1% SDS 

10 solution followed by two 1 hour washes in the above solution without SDS. The 
membrane was air dried briefly, wrapped in Saran Wrap and exposed to X-ray film. 

An ECB Bt toxin receptor transcript of 5.5 kilobase was expressed strongly in 
the larval instars with much reduced expression in the pupal stage. The expression 
levels appeared to be fairly consistent from first to fifth instar, while decreasing 

15 markedly in the pupal stage. There were no detectable transcripts in either the egg or 
adult stages. These results indicate that the ECB Bt toxin transcript is being produced 
in the susceptible stages of the insects life cycle, while not being produced in stages 
resistant to the toxic effects of Cryl Ab. 



20 Example 5: Tissue and subcellular expression of the ECB Bt toxin receptor: 

Fifth instar ECB were dissected to isolate the following tissues: fat body (FB), 
malpighian tubules (MT), hind gut (HG), anterior midgut (AM) and posterior midgut 
(PM). Midguts from fifth instar larvae were also isolated for brush border membrane 
vesicle (BBMV) preparation using the well known protocol by Wolfersberger et 

25 a/.(1987) Comp. Biochem, Physiol. #6,4:301-308. Tissues were homogenized in Tris 
buffered saline, 0.1 % tween-20, centrifuged to pellet insoluble material, and 
transferred to a fresh tube. 50 ug of protein from each preparation was added to SDS 
sample buffer and B-mercaptoethanol, heated to 100°C for 10 minutes and loaded 
onto a 4-12% Bis-Tris gel (Novex). After electrophoresis, the proteins were 

30 transferred to a nitrocellulose membrane using a semi-dry apparatus^ The membrane 
was blocked in 5% nonfat dry milk buffer for 1 hour at room temperature with gentle 
agitation. The primary antibody (Example 1 ) was added to a final dilution of 1 :5000 
and allowed to hybridize for 1 hour. The blot was then washed three times for 20 
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minutes each in nonfat milk buffer. The blot was then hybridized with the secondary 
antibody (goat anti-rabbit with alkaline phosphatase conjugate) at a dilution of 
1 : 10000 for 1 hour at room temperature. Washes were performed as before. The 
bands were visualized by using the standard chemiluminescent protocol (Tropix 
5 western light protein detection kit). 

The ECB Bt toxin receptor protein was only visible in the BBMV enriched 
lane, and not detected in any of the other ECB tissues types. This result indicates that 
the expression of the ECB Bt toxin receptor protein is at very low levels, since the 
BBMV preparation is a 20-30 fold enriched fraction of the midgut brush border. The 

1 0 result supports propositions that the ECB Bt toxin receptor is an integral membrane 
protein uniquely associated with the brush border. It also demonstrates that the ECB 
Bt toxin receptor is expressed in the envisioned target tissue for CrylAb toxins. 
However, the result does not necessarily rule out expression in other tissue types, 
albeit the expression of this protein in those tissues may be lower than in the BBMV 

1 5 enriched fraction. 

All publications and patent applications mentioned in the specification are 
indicative of the level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 

20 individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced within the scope of the 
appended claims. 



WO 01/36639 PCT/US00/31674 



Applicant's or agent's 


International application No. 


. — f . — 


file reference 35718/204291 







INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule13b/s) 



A. The indications made below relate to the deposited microorganism or other biological material referred to in the description on page 3, lines 
23.26and31 . 



B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet □ 


Name of depository institution 

American Type Culture Collection 


Address of depositary institution (including postal code and country) 




10801 University Blvd. 
Manassas, VA 201 10-2209 US 




Date of deposit 

j 25 June 1999 (25.06.99) 


Accession Number 

PTA-278 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet □ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicators are not tor all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS fleave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



k/tw 



For receiving Office use only 

l received with the interMfaytopplication 




PCt/hiternatlj 
(703)305-3670 



Form PCT/RO/134 (July 1998) 



For International Bureau use only 



Q This sheet was received with the International Bureau on: 




39 



WO 01/36639 



PCT/US00/31674 



Applicant's or agent's 

file reference 3571 8/204291 



Internationa) application No. 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRu!e13tts) 



A. The indications made below relate to the deposited fTricroorganism or other bk>togtcal material referred to in the description on page 3, fines 
23/26 and 31 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet fl 


Name of depository institution 

American Type Culture Collection 


Address of depositary institution (including postal code and country) 




10801 University Blvd. 
Manassas, VA 201 10-2209 US 




Date of deposit 

25 April 2000 (25.04.00) 


Accession Number 

PTA-1760 


C. ADDITIONAL INDICATIONS (leave blank if not appTtcabfe) 


This information is continued on an additional sheet □ 




D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicators are not for all designated States) 




E. SEPARATE FURNISHING OF INDICATIONS (leave blank ff not applicable) 


The indications listed below will be submitted to the International Bureau 1, 
Number of Deposit*) 


ater (specify the general nature of the indications e.g., "Accession 



For receiving Office use only 




For International Bureau use only 



□ This sheet was received with the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1998) 



40 



WO 01/36639 




PCT/US00/31674 


I Applicants or agent's 

| file reference 35718/204291 


International application No. | 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule136/s) 



A. The indications made below relate to the deposited mk*oorganism or other biological material referred to in the description on page 3, Ones 
23,26and31 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet □ 


Name of depository institution 

American Type Culture Collection 


Address of depositary institution (including postal code and county) 




10801 University Blvd. 
Manassas, VA 201 10-2209 US 




Date of deposit 

11 July 2000 (11.07.00) 


Accession Number 

PTA-2222 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) 


This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (rf the indicators are nof for aff designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (feave blank if not applicable) 

The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications e.g., 'Accession 
Number of Deposit) 




For receiving Office use only 



received with the international application 



Hstfrod 

fHJT/internari) 
(703)306-3 6 70 



For International Bureau use only 



□ This sheet was received with the International Bureau on: 




brm PCT/RO/134 (July 1998) 



41 



WO 01/36639 

THAT WHICH IS CLAIMED: 



PCT/USOO/31674 



1 . An isolated nucleic acid molecule having a nucleotide sequence 
encoding a Bt toxin receptor, said sequence selected from the group consisting of: 

a) a nucleotide sequence set forth in SEQ ID NO: 1 , SEQ ID NO: 3 
orSEQIDNO:5; 

5 b) a nucleotide sequence having at least about 60 % identity to the 

nucleotide sequence of a); 

c) a nucleotide sequence having at least about 70 % identity to the 
nucleotide sequence of a); 

d) a nucleotide sequence having at least about 75 % identity to the 
10 nucleotide sequence of a); 

e) a nucleotide sequence having at least about 85 % identity to 
the nucleotide sequence of a); 

f) a nucleotide sequence having at least about 95 % identity to the 
nucleotide sequence of a); 

15 g) a nucleotide sequence consisting of at least 22 contiguous 

nucleotides of the nucleotide sequence set forth in SEQ ID NO: 1 ; 

h) a nucleotide sequence consisting of at least about 15 contiguous 
nucleotides of the nucleotide sequence set forth in SEQ ID NO:3, or SEQ ID NO:5 ; 

i) a nucleotide sequence that hybridizes under stringent 
20 conditions to the nucleotide sequence of a); and 

2. The nucleic acid molecule of claim 1 , wherein said toxin is a Cry 1 A 

toxin. 

25 3. The nucleic acid of claim 2, wherein said Cry 1 A toxin is a Cry 1 A(b) 

toxin. 

4. An isolated polypeptide having the amino acid sequence selected from 
the group consisting of: 

a) an amino acid sequence set forth in SEQ ID NO: 2, SEQ ED NO: 
30 4, or SEQ ID NO: 6; 

b) an amino acid sequence having at least about 52% identity to 
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the amino acid sequence set forth in SEQ ID NO: 2; 

c) an amino acid sequence having at least about 60 % identity to the 
amino acid sequence of a); 

d) an amino acid sequence having at least about 70 % identity to the 
5 amino acid sequence of a); 

e) an amino acid sequence having at least about 75 % identity to 
an amino acid sequence of a); 

f) an amino acid sequence having at least about 85 % identity to an 
amino acid sequence of a); 

10 g) an amino acid sequence having at least about 95 % identity to an 

amino acid sequence of a); 

h) an amino acid comprising at least about 15 contiguous residues of 
the amino acid nucleotide sequence of a); 

i) an amino acid sequence encoded by a nucleotide sequence 
15 according to claim 1; 

j) a variant of the amino acid sequence of a); 

k) a fragment of the amino acid sequence of a); and 

1) a fragment of the amino acid sequence of a) that binds Bt toxin. 

20 5. A fusion polypeptide comprising the polypeptide of claim 4, and at 

least one polypeptide of interest 

6. The fusion polypeptide of claim 5, wherein said polypeptide of interest 
is a toxin receptor. 

25 

7. An expression cassette comprising a nucleotide sequence encoding the 
fusion polypeptide of claim 5, wherein said nucleotide sequence is operably linked to 
a promoter that drives expression in a cell of interest. 

30 8. The expression cassette of claim 7 wherein said polypeptide of interest 

is a toxin receptor. 

9. An antibody preparation specific for the polypeptide of claim 4. 

— - . . _ .43 - - - 
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1 0. An expression cassette comprising at least one nucleotide sequence 
according to claim 1, wherein said nucleotide sequence is operably linked to a 
promoter that drives expression in a cell of interest. 

5 

1 1 . The expression cassette of claim 10, wherein said cell of interest is an 
insect or mammalian cell. 

12. The expression cassette of claim 10 wherein said cell of interest is a 
10 microorganism. 

13. The expression cassette of claim 12 wherein said microorganism is 
yeast or bacteria. 

1 5 14. A vector for delivery of a nucleotide sequence to a cell of interest, the 

vector comprising at least one nucleotide sequence according to claim 1. 

15. A cell containing the vector of claim 14. 

20 1 6. A transformed cell of interest having stably incorporated within its 

genome a nucleotide sequence selected from the group consisting of: 

a) a nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO: 3 
orSEQIDNO:5; 

b) a nucleotide sequence having at least about 60 % identity to the 
25 nucleotide sequence of a); 

c) a nucleotide sequence having at least about 70 % identity to the 
nucleotide sequence of a); 

d) a nucleotide sequence having at least about 75 % identity to the 
nucleotide sequence of a); 

30 e) a nucleotide sequence having at least about 85 % identity to 

the nucleotide sequence of a); 

f) a nucleotide sequence having at least about 95 % identity to the 
nucleotide sequence of a); 
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g) a nucleotide sequence consisting of at least 22 contiguous 
nucleotides of the nucleotide sequence set forth in SEQ ID NO:l ; 

h) a nucleotide sequence consisting of at least about 1 5 contiguous 
5 nucleotides of the nucleotide sequence set forth in SEQ ID NO:3, or SEQ ID NO:5 ; 

i) a nucleotide sequence that hybridizes under stringent conditions to 
the nucleotide sequence of a); and 

17. The transformed cell of claim 16 , wherein said cell is a plant cell. 

10 

1 8. The transformed cell of claim 1 7, wherein said plant cell is 
monocotyledonous. 

19. A method for screening for ligands that bind Bt toxin receptor, said 
1 5 method comprising: 

i) providing at least one Bt toxin receptor polypeptide according 

to claim 4; 

ii) contacting said polypeptide with a sample and a control ligand 
under conditions promoting binding; and 

20 iii) determining binding characteristics of said sample ligand, 

relative to said control ligand. 

20. A method for screening for ligands that bind Bt toxin receptor, said 
method comprising: 

25 i) providing at least one Bt toxin receptor polypeptide having the 

amino acid sequence selected from the group consisting of a, b, c, d, e, f, g, h , i, and j 
of claim 4 in cells expressing said polypeptide wherein said polypyptide comprises a 
toxin binding domain ; 

ii) contacting said cells with a sample and a control ligand under 
30 conditions promoting binding; and 

iii) determining binding characteristics of said sample ligand, 
relative to said control ligand. 

21. The method of claim 20 wherein said toxin is a Cry 1 A toxin. 
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22. A method for screening for toxins that bind Bt toxin receptor, said 
method comprising the steps of claim 20; further comprising determining viability of 
said cells contacted with a sample ligand relative to said cells contacted with a control 

5 ligand. 

23. The method of claim 20, wherein said sample ligand is a chimeric 
polypeptide comprising at least one primary polypeptide that binds a polypeptide 
having the amino acid sequence selected from the group consisting of: 

10 a) an amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 4, 

or SEQ ID NO: 6; 

b) an amino acid sequence having at least about 52% identity to the 
amino acid sequence set forth in SEQ ID NO: 2; 

c) an amino acid sequence having at least about 60 % identity to the 
15 amino acid sequence of a); 

d) an amino acid sequence having at least about 70 % identity to the 
amino acid sequence of a); 

e) an amino acid sequence having at least about 75 % identity to an 
amino acid sequence of a); 

20 £) an amino acid sequence having at least about 85 % identity to an 

amino acid sequence of a); 

g) an amino acid sequence having at least about 95 % identity to an 
amino acid sequence of a); 

h) an amino acid comprising at least about 1 5 contiguous residues of 
25 the amino acid nucleotide sequence of a); 

i) an amino acid sequence encoded by a nucleotide sequence having at 
least about 60 % identity to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ 
ID NO: 3 or SEQ ID NO: 5; and 

j) a variant of the amino acid sequence of a). 



30 



24. The method of claims 21, wherein said sample ligand is a chimeric 
polypeptide comprising at least one primary polypeptide that binds a polypeptide 
having the amino acid sequence selected from the group consisting of: 

46 



WO 01/36639 



PCT/US00/31674 .. 



a) an amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 6; _ 

b) an amino acid sequence having at least about 52% identity to the 
amino acid sequence set forth in SEQ ID NO: 2; 

5 c) an amino acid sequence having at least about 60 % identity to the 

amino acid sequence of a); 

d) an amino acid sequence having at least about 70 % identity to the 
amino acid sequence of a); 

e) an amino acid sequence having at least about 75 % identity to an 
10 amino acid sequence of a); 

f) an amino acid sequence having at least about 85 % identity to an 
amino acid sequence of a); 

g) an amino acid sequence having at least about 95 % identity to an 
amino acid sequence of a); 

15 h) an amino acid comprising at least about 1 5 contiguous residues of 

the amino acid nucleotide sequence of a); 

i) an amino acid sequence encoded by a nucleotide sequence having at 

least about 60 % identity to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ 

ID NO: 3 or SEQ ID NO: 5; and 
20 j) a variant of the amino acid sequence of a). 

25. The method of claims 22, wherein said sample ligand is a chimeric 
polypeptide comprising at least one primary polypeptide that binds a polypeptide 
25 having the amino acid sequence selected from the group consisting of: 

a) an amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 4, 
or SEQ ID NO: 6; 

b) an amino acid sequence having at least about 52% identity to the 
amino acid sequence set forth in SEQ ID NO: 2; 

30 c) an amino acid sequence having at least about 60 % identity to the 

amino acid sequence of a); 

d) an amino acid sequence having at least about 70 % identity to the 
amino acid sequence of a); 
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e) an amino acid sequence having at least about 75 % identity to an 
amino acid sequence of a); 

f) an amino acid sequence having at least about 85 % identity to an 
amino acid sequence of a); 

5 g) an amino acid sequence having at least about 95 % identity to an 

amino acid sequence of a); 

h) an amino acid comprising at least about 1 5 contiguous residues of 
the amino acid nucleotide sequence of a); 

i) an amino acid sequence encoded by a nucleotide sequence having at 
10 least about 60 % identity to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ 

ID NO: 3 or SEQ ID NO: 5; and 

j) a variant of the amino acid sequence of a). 
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SEQUENCE LISTING 

<110> Flannagan, Ronald D. 
Mathis, John P. 
Meyer , Terry E. 



<120> Novel Bt Toxin Receptors From 

Lepidopteran Insects and Methods of Use 



<130> 35718/204291 

<150> 60/166,285 

<151> 1999-11-18 

<160> 11 ~ 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 5498 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 
<221> CDS 

<222> (162) ... (5312) 
<400> 1 

cataataaca ataaagagga agtgtgtgtg aaaaacgaag aagttaataa acctggataa 60 
ttaaacctga aaaaaaccgg tgtttaagtg gaatttttgc tgaaggacaa ccgtgggata 120 
gctcaaatat taaaattcta cataactaag gatcatgcaa a atg ggg gtt gag agg 176 

Met Gly Val Glu Arg 
1 5 

ttc ttc cca gca gtg eta ctg gtc tct tta gec tct gee gca eta gec 224 
Phe Phe Pro Ala Val Leu Leu Val Ser Leu Ala Ser Ala Ala Leu Ala 
10 15 20 

aac caa cga tgt teg tac att ate gca ata cca aga ccg gag act ccg 272 
Asn Gin Arg Cys Ser Tyr He He Ala lie Pro Arg Pro Glu Thr Pro 
25 30 35 

gaa ctg ccg cct att gat tac gaa gga aaa tea tgg agt gaa cag cct 320 
Glu Leu Pro Pro He Asp Tyr Glu Gly Lys Ser Trp Ser Glu Gin Pro 
40 45 50 

eta ata ccc ggc ccg ace cga gag gaa gta tgt atg gag aac ttc tta 368 
Leu He Pro Gly Pro Thr Arg Glu Glu Val Cys Met Glu Asn Phe Leu 
55 60 65 

ccg gat caa atg att cag gtc ata tac atg gag gaa gaa ate gaa gga 416 
Pro Asp Gin Met He Gin Val He Tyr Met Glu Glu Glu He Glu Gly 
70 75 80 85 

gac gtc ate att gcg aag ctt aac tat caa ggg tec aac acg ccg gtg 4 64 

Asp Val He He Ala Lys Leu Asn Tyr Gin Gly Ser Asn Thr Pro Val 

90 95 v . . t 100 

ctg teg att atg tea ggc cag ccc aga gee cag ctg ggc cct gag ttt 512 

1 
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Leu Ser lie Met Ser Gly Gin Pro Arg Ala Gin Leu Gly Pro Glu Phe 
105 HO 115 

cga cag aat gaa gca gac ggc caa tgg age ctt gtt att acg caa aga 
Arg Gin Asn Glu Ala Asp Gly Gin Trp Ser Leu Val lie Thr Gin Arg 
120 125 130 

caa gac tac gag aca gca acc atg cag age tat gtg ttc tea ate caa 
Gin Asp Tyr Glu Thr Ala Thr Met Gin Ser Tyr Val Phe Ser lie Gin 
135 140 145 

gtg gag ggt gaa tea cag gee gta ctg gtg gcg ctg gag ata gtc aac 
Val Glu Gly Glu Ser Gin Ala Val Leu Val Ala Leu Glu lie Val Asn 
150 155 160 165 

ate gac gac aat ccg ccc ate ctg caa gtg gtc age gee tgc gta att 
lie Asp Asp Asn Pro Pro He Leu Gin Val Val Ser Ala Cys Val He 
170 175 180 



560 



608 



656 



704 



cca gaa cat ggc gag get aga ctg acc gac tgc gtg tac caa gtg 
Pro Glu His Gly Glu Ala Arg Leu Thr Asp Cys Val Tyr Gin Val Ser 
185 190 1$5" 

gac cgc gac ggt gaa ate age acc cgc ttc atg acg ttc cgt gtc gac 
Asp Arg Asp Gly Glu He Ser Thr Arg Phe Met Thr Phe Arg Val Asp 
200 205 210 

age age agg get gca gat gaa age ate ttc tac atg gtt gga gaa tac 
Ser Ser Arg Ala Ala Asp Glu Ser He Phe Tyr Met Val Gly Glu Tyr 
215 220 225 

gac ccc age gac tgg ttc aat atg aag atg act gtg ggg ate aat teg 
Asp Pro Ser Asp Trp Phe Asn Met Lys Met Thr Val Gly He Asn Ser 
230 235 240 245 



tea 752 



800 



848 



896 



ccc ttg aac ttc gag aca act cag ctt cat ata ttt age gtc aca get 944 
Pro Leu Asn Phe Glu Thr Thr Gin Leu His He Phe Ser Val Thr Ala 
250 255 260 



tct gac teg eta ccg aac aac cac acg gtc acc atg atg gtg caa gtg 
Ser Asp Ser Leu Pro Asn Asn His Thr Val Thr Met Met Val Gin Val 
265 270 275 



992 



gag aac gta gag tct egg ccc cct cgc tgg gtg gag ate ttc tea gtg 1040 
Glu Asn Val Glu Ser Arg Pro Pro Arg Trp Val Glu He Phe Ser Val 
280 285 290 

cag cag ttt gac gag aag act aat cag age ttc tec etc cgc gcg ata 1088 
Gin Gin Phe Asp Glu Lys Thr Asn Gin Ser Phe Ser Leu Arg Ala He 
295 300 305 

gac ggg gac acg gga ate aat agg gee ate aac tat acc etc ate agg 1136 
Asp Gly Asp Thr Gly He Asn Arg Ala He Asn Tyr Thr Leu He Arg 
310 315 320 325 



gat gac get gac gac ttc ttt tec ctg gag gtg att gaa gac gga get 
Asp Asp Ala Asp Asp Phe Phe Ser Leu Glu Val He Glu Asp Gly Ala 
330 335 340 

att ctg cac gtg act gag ate gac cgc gac aag ctt gaa aga gag ctt 
He Leu His Val Thr Glu He Asp Arg Asp Lys Leu Glu Arg Glu Leu 
345 350 355 

2 
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ttc aac etc acc ate gtt get tac aaa tct act gac get age ttt gca - - lrZ80 
Phe Asn Leu Thr lie Val Ala Tyr Lys Ser Thr Asp Ala Ser Phe Ala 
360 365 370 

aca gag gee cac att ttc ate ate gtc aac gac gtc aat gat cag cga 1328 
Thr Glu Ala His lie Phe lie lie Val Asn Asp Val Asn Asp Gin Arg 
375 380 385 

ccc gag ccg ctg cat aaa gaa tac agt att gat ate atg gag gaa act 1376 
Pro Glu Pro Leu His Lys Glu Tyr Ser lie Asp lie Met Glu Glu Thr 
390 395 400 405 

cca atg act eta aac ttc aat gaa gaa ttt gga ttc cat gat cga gat 1424 
Pro Met Thr Leu Asn Phe Asn Glu Glu Phe Gly Phe His Asp Arg Asp 
410 415 420 

ttg ggt gaa aac get caa tac aca gtg gaa ctt gag gac gtg ttc ccg 1472 
Leu Gly Glu Asn Ala Gin Tyr Thr Val Glu Leu Glu Asp Val Phe Pro 
425 430 435 

cca ggg gcg gcg tec gca ttc tac ate gcg ccg ggg age ggc tac cag 1520 
Pro Gly Ala Ala Ser Ala Phe Tyr lie Ala Pro Gly Ser Gly Tyr Gin 
440 445 450 

agg cag acc ttc ate atg ggc acc ata aac cac acc atg ctg gat tac 1568 
Arg Gin Thr Phe lie Met Gly Thr lie Asn His Thr Met Leu Asp Tyr 
455 460 465 

gaa gat gtc att ttt cag aac ate ate att aag gtc aaa gca gtg gac 1616 
Glu Asp Val lie Phe Gin Asn lie lie lie Lys Val Lys Ala Val Asp 
470 475 480 485 

atg aac aac get age cac gtg ggc gag gcg ctg gtg tac gtg aac ctg . 1664 
Met Asn Asn Ala Ser His Val Gly Glu Ala Leu Val Tyr Val Asn Leu 
490 495 ** 500 

ate aac tgg aac gac gaa ctt ccc ate ttc gag gag age age tac tec 1712 
lie Asn Trp Asn Asp Glu Leu Pro lie Phe Glu Glu Ser Ser Tyr Ser 
505 510. 515 

gcg teg ttt aag gag acc gtc ggc gee ggc ttc ccg gtg gee acg gtg 1760 
Ala Ser Phe Lys Glu Thr Val Gly Ala Gly Phe Pro Val Ala Thr Val 
520 525 530 

etc gee etc gac aga gac ate gac gac gta gta gtg cat tea ttg atg 1808 
Leu Ala Leu Asp Arg Asp lie Asp Asp Val Val Val His Ser Leu Met 
535 540 545 

ggc aac get gtt gac tac ctg ttc ata gat gaa tea acg gga gag ate 1856 
Gly Asn Ala Val Asp Tyr Leu Phe lie Asp Glu Ser Thr Gly Glu lie 
550 555 560 565 

ttc gtg age atg gac gat gec ttc gac tac cac cga cag aac act eta 1904 
Phe Val Ser Met Asp Asp Ala Phe Asp Tyr His Arg Gin Asn Thr Leu 

570 -v- 575 580 

ttt gtt cag gtg cgc get gac gat act ttg ggc gac ggc cca cac aac 1952 
Phe Val Gin Val Arg Ala Asp Asp Thr Leu Gly Asp Gly Pro His Asn 
585 590 595 

aca gtg acc acc cag ctg gtg ata gaa ctg gag gat gtc aac aac act 2000 
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Thr Val Thr Thr Gin Leu Val lie Glu Leu Glu Asp Val Asn Asn Thr 
600 605 610 

cct ccc acc eta cgc ttg ccc cgt teg act cca age gtc gag gag aac 

Pro Pro Thr Leu Arg Leu Pro Arg Ser Thr Pro Ser Val Glu Glu Asn 

615 " 620 625 

gtt ccc gaa gga tac gag ata tec egg gaa ate act get acc gac ccg 

Val Pro Glu Gly Tyr Glu He Ser Arg Glu He Thr Ala Thr Asp Pro 

630 635 640 645 



ata gtt ate gaa acg ata tac ccc acc gag ggc aac egg ggt tec gee 
He Val He Glu Thr He Tyr Pro Thr Glu Gly Asn Arg Gly Ser Ala 
680 685 690 * " 



acg cag acg atg cgc gtg cgc gag ctg gcg gee age ggc acg etc ate 
Thr Gin Thr Met Arg Val Arg Glu Leu Ala Ala Ser Gly Thr Leu He 
760 765 770 



age gac gag tgc tec ctg gaa aac cgc acg gaa tgt cct cca gat tec 
Ser Asp Glu Cys Ser Leu Glu Asn Arg Thr Glu Cys Pro Pro Asp Ser 
840 ' 845 850 

4 



2048 



2096 



gac acc age gee tac ctg tgg ttc gag ate gac tgg gac tec acc tgg 214 4 

Asp Thr Ser Ala Tyr Leu Trp Phe Glu He Asp Trp Asp Ser Thr Trp 

650 655 660 

gee acc aag cag ggc aga gag acc aac cct act gaa tac gtc ggg tgt 2192 

Ala Thr Lys Gin Gly Arg Glu Thr Asn Pro Thr Glu Tyr Val Gly Cys 
665 670 675 



2240 



ate ggg cgc etc gtg gtg caa gag ate egg gac aac gtc acc ate gac 2288 

He Gly Arg Leu Val Val Gin Glu He Arg Asp Asn Val Thr He Asp 

695 700 705 

ttc gag gaa ttc gag atg ctt tac etc acc gtc cgc gtg agg gac etc 2336 

Phe Glu Glu Phe Glu Met Leu Tyr Leu Thr Val Arg Val Arg Asp Leu 
710 715 720 725 

aac act gtc ate gga gat gac tac gat gag gcg acg ttc acg ate aca 2384 

Asn Thr Val He Gly Asp Asp Tyr Asp Glu Ala Thr Phe Thr He Thr 
730 735 740 

ata ate gac atg aac gac aac gcg ccg ate ttc gcg aac ggc acg ctg 2432 
He He Asp Met Asn Asp Asn Ala Pro He Phe Ala Asn Gly Thr Leu 
745 750 755 



2480 



ggc tec gtg etc gee acc gac ate gac ggc ccg etc tac aac caa gtg 2528 
Gly Ser Val Leu Ala Thr Asp He Asp Gly Pro Leu Tyr Asn Gin Val 
775 780 785 _ 

cgc tac act ata caa cct aga aac aac act ccc gag gga tta gtg aag 2576 
Arg Tyr Thr He Gin Pro Arg Asn Asn Thr Pro Glu Gly Leu Val Lys 
790 795 800 805 

att gac ttc aca act ggt caa att gag gtg gat gcg aac gag gcg ate 
He Asp Phe Thr Thr Gly Gin He Glu Val Asp Ala Asn Glu Ala He 
810 815 820 

gat gca gac gaa ccc tgg cgc ttc tac ttg tac tac acc gtc ate get 2672 
Asp Ala Asp Glu Pro Trp Arg Phe Tyr Leu Tyr Tyr Thr Val He Ala 
825 830 835 



2624 



2720 



Wtrv ^wr> 
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aac tac ttc gaa gtt cca ggc gat ate gaa ata gaa ate ate gac aca - - ^768 
Asn Tyr Phe Glu Val Pro Gly Asp lie Glu He Glu He He Asp Thr 
855 860 865 

aac aac aaa gtg cct gag ccg etc act gag aag ttc aac acg acg gtg 2816 
Asn Asn Lys Val Pro Glu Pro Leu Thr Glu Lys Phe Asn Thr Thr Val 
870 875 880 885 

tac gtc tgg gag aat gec acg age ggc gac gag gtg gtc cag ctg tac 2864 
Tyr Val Trp Glu Asn Ala Thr Ser Gly Asp Glu Val Val Gin Leu Tyr 
890 895 900 

tec cac gac cgt gac aga gac gag ttg tac cac acg gta cga tac acg 2912 
Ser His Asp Arg Asp Arg Asp Glu Leu Tyr His Thr Val Arg Tyr Thr 
905 910 915 

atg aac ttt gcg gtg aac ccc cga ctg egg gat ttc ttc gag gtg gac "2960 
Met Asn Phe Ala Val Asn Pro Arg Leu Arg Asp Phe Phe Glu Val Asp 
920 925 930 

ctg gac act ggt cgc ctt gag gtg cat tac ccg ggg gac gaa v aaa ttg 3008 
Leu Asp Thr Gly Arg Leu Glu Val His Tyr Pro Gly Asp Glu Lys Leu 
935 940 945 

gac cgc gat ggg gat gag cct aca cat act ate ttt gta aat ttc ate 3056 
Asp Arg Asp Gly Asp Glu Pro Thr His Thr He Phe Val Asn Phe He 
950 955 960 965 

gat aac ttc ttt tct gat ggt gac ggt agg aga aac cag gac gaa gtt 3104 
Asp Asn Phe Phe Ser Asp Gly Asp Gly Arg Arg Asn Gin Asp Glu Val 
970 975 980 

gaa ata ttt gtc gtt eta ttg gat gtg aac gac aac get cct gag atg 3152 
Glu He Phe Val Val Leu Leu Asp Val Asn Asp Asn Ala Pro Glu Met 
985 990 995 

cca ttg cct gat gaa etc egg ttt gat gtt tec gaa gga gca gtt get 3200 
Pro Leu Pro Asp Glu Leu Arg Phe Asp Val Ser Glu Gly Ala Val Ala 
1000 1005 1010 

ggt gtc cgt gta etc cca gaa ate tac gca ccg gac agg gat gaa cca 3248 
Gly Val Arg Val Leu Pro Glu He Tyr Ala Pro Asp Arg Asp Glu Pro 
1015 1020 1025 

gac acg gac aac teg cgt gtc ggt tac gga ate ctg gac etc acg ate 3296 
Asp Thr Asp Asn Ser Arg Val Gly Tyr Gly He Leu Asp Leu Thr He 
1030 1035 1040 * 1045 

ace gac cga gac ate gag gtg ccg gat etc ttc ace atg ate teg att 3344 
Thr Asp Arg Asp lie Glu Val Pro Asp Leu Phe Thr Met He Ser He 
1050 1055 1060 

gaa aac aaa act ggg gaa ctt gag ace get atg gac ttg agg ggg tat 3392 
Glu Asn Lys Thr Gly Glu Leu Glu Thr Ala Met Asp Leu Arg Gly Tyr 
1065 1070 1075 

^99 ggc act tac gaa ata ttc att gag gee ttc gac cac ggc tac ccg 3440 
Trp Gly Thr Tyr Glu He Phe He Glu Ala Phe Asp His Gly Tyr Pro 
1080 1085 1090 



cag cag agg tec aac gag acg tac ace ctg gtc ate cgc ccc tac aac 3488 
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Gin Gin Arg Ser Asn Glu Thr Tyr Thr Leu Val lie Arg Pro Tyr Asn 

1095 1100 1105 - ~ — ' 

ttc cac cac cct gtg ttc gtg ttc ccg caa ccc gac tec gtc att egg 3536 
Phe His His Pro Val Phe Val Phe Pro Gin Pro Asp Ser Val He Arg 
1110 1115 1120 1125 

ctt tct agg gag cgc gca aca gaa ggc ggc gtt ctg gcg acg get gee 3584 
Leu Ser Arg Glu Arg Ala Thr Glu Gly Gly Val Leu Ala Thr Ala Ala 
1130 1135 1140 

aac gag ttc ctg gag ccg ate tac gee acc gac gag gac ggc etc cac 3632 
Asn Glu Phe Leu Glu Pro lie Tyr Ala Thr Asp Glu Asp Gly Leu His 
1145 1150 1155 

gcg ggc age gtc acg ttc cac gtc cag gga aat gag gag gee gtt cag 3680 
Ala Gly Ser Val Thr Phe His Val Gin Gly Asn Glu Glu Ala Val Gin 
1160 1165 1170 

tac ttt gat ata act gaa gtg gga gca gga gaa aat age ggg cag ctt 3728 
Tyr Phe Asp lie Thr Glu Val Gly Ala Gly Glu Asn Ser Gly Gin Leu 
1175 1180 1185 

ata tta cgc cag ctt ttc cca gag caa ate aga caa ttc agg ate acg 3776 
He Leu Arg Gin Leu Phe Pro Glu Gin He Arg Gin Phe Arg He Thr 
1190 1195 1200 1205 

ate egg gec acg gac ggc ggc acg gag ccc ggc ccg ctt tgg acc gac 3824 
lie Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly Pro Leu Trp Thr Asp 
1210 1215 1220 

gtc acg ttt teg gtg gtc ttc gta ccc aca cag ggc gac cca gtg ttc 3872 
Val Thr Phe Ser Val Val Phe Val Pro Thr Gin Gly Asp Pro Val Phe 
1225 1230 1235 

age gaa aat gca get act gtc gee ttc ttc gag ggt gaa gaa ggc etc 3920 
Ser Glu Asn Ala Ala Thr Val Ala Phe Phe Glu Gly Glu Glu Gly Leu 
1240 1245 1250 

cgt gag agt ttt gag ctg ccg caa gca gaa gac ctt aaa aac cac etc 3968 
Arg Glu Ser Phe Glu Leu Pro Gin Aia Glu Asp Leu Lys Asn His Leu 
1255 1260 1265 

tgc gaa gat gac tgc caa gat ate tac tac agg ttt att gac ggc aac 4016 
Cys Glu Asp Asp Cys Gin Asp He Tyr Tyr Arg Phe He Asp Gly Asn 
1270 1275 1280 1285 

aac gag ggt ctt ttc gta ctg gac cag tea age aac gtc ate tec ctt 4064 
Asn Glu Gly Leu Phe Val Leu Asp Gin Ser Ser Asn Val He Ser Leu 
1290 1295 1300 

gcg cag gag ttg gac cgc gag gtg gee acg tct tac acg ctg cac ate 4112 
Ala Gin Glu Leu Asp Arg Glu Val Ala Thr Ser Tyr Thr Leu His He 
1305 1310 1315 

gcg gcg age aac teg ccc gac gec act ggg ate cct ctg cag act tec 4160 
Ala Ala Ser Asn Ser Pro Asp Ala Thr Gly He Pro Leu Gin Thr Ser 
1320 1325 1330 

ate etc gtt gtc acg gtc aat gta aga gaa gcg aac ccg cgc cca att 4208 
He Leu Val Val Thr Val Asn Val Arg Glu Ala Asn Pro Arg Pro He 
1335 1340 1345 
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ttc gag cag gac ctt tac aca gcg ggc att teg acg ttg gac age att 
Phe Glu Gin Asp Leu Tyr Thr Ala Gly lie Ser Thr Leu Asp Ser lie 
1350 1355 1360 1365 



- A2rS6 



ggc egg gaa ttg ctt act gtc agg gcg age cac aca gaa gac gac ace 
Gly Arg Glu Leu Leu Thr Val Arg Ala Ser His Thr Glu Asp Asp Thr 
1370 1375 1380 



4304 



ate acg tac acc ata gac cgt gcg age atg cag ctg gac age age eta 
lie Thr Tyr Thr lie Asp Arg Ala Ser Met Gin Leu Asp Ser Ser Leu 
1385 1390 1395 



4352 



gaa gee gtg cgc gac teg gee ttc gcg ctg cat gcg acc acc ggc gtg 
Glu Ala Val Arg Asp Ser Ala Phe Ala Leu His Ala Thr Thr Gly Val 
1400 1405 1410 



4400 



ctt teg etc aat atg cag ccc acc get tec atg cac ggc atg ttc gag 
Leu Ser Leu Asn Met Gin Pro Thr Ala Ser Met His Gly Met Phe Glu 
1415 1420 1425 



^1448 



ttc gac gtc ate get acg gat aca get tct gca ate gac aca ^cc cgt 
Phe Asp Val He Ala Thr Asp Thr Ala Ser Ala He Asp Thr Ala Arg 
1430 1435 1440 1445 



4496 



gtg aaa gtc tac etc ate tea teg caa aac cgc gtg acc ttc att ttc 
Val Lys Val Tyr Leu He Ser Ser Gin Asn Arg Val Thr Phe He Phe 
1450 1455 1460 



4544 



gat aac caa ctt gag acc gtt gag cag aac aga aat ttc ata gcg gec 
Asp Asn Gin Leu Glu Thr Val Glu Gin Asn Arg Asn Phe He Ala Ala 
1465 1470 1475 



4592 



acg ttc age acc ggg ttc aac atg acg tgc aac ate gac cag gtg gtg 
Thr Phe Ser Thr Gly Phe Asn Met Thr Cys Asn He Asp Gin Val Val 
1480 1485 1490 



4640 



ccg ttc age gac age age ggc gtg gcg caa gac gac acc acc gag gtg 
Pro Phe Ser Asp Ser Ser Gly Val Ala Gin Asp Asp Thr Thr Glu Val 
1495 1500 1505 



4688 



cgc gcg cac ttc ate egg gac aac gtg ccc gtg cag gca caa gag gtc 
Arg Ala His Phe He Arg Asp Asn Val Pro Val Gin Ala Gin Glu Val 
1510 1515 1520 1525 



4736 



gag gee gtc cgc age gac acg gtg ctg ctg cgc acc ate cag ctg atg 
Glu Ala Val Arg Ser Asp Thr Val Leu Leu Arg Thr He Gin Leu Met 
1530 1535 1540 



4784 



ctg age acc aac age ctg gtg ctg caa gac ctg gtg acg ggt gac act 
Leu Ser Thr Asn Ser Leu Val Leu Gin Asp Leu Val Thr Gly Asp Thr 
1545 1550 1555 



4832 



ccg acg eta ggc gag gag tea atg cag ate gee gtc tac gca eta gee 
Pro Thr Leu Gly Glu Glu Ser Met Gin He Ala Val Tyr Ala Leu Ala 
1560 1565 1570 



4880 



gcg etc tec get gtg eta ggc ttc etc tgc etc gta ctg ctt etc gca 
Ala Leu Ser Ala Val Leu Gly Phe Leu Cys Leu Val Leu Leu Leu Ala 
1575 1580 1585 



4928 



ttg ttc tgt agg aca aga gca ctg aac egg cag ctg caa gca etc tec 
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Leu Phe Cys Arg Thr Arg Ala Leu Asn Arg Gin Leu Gin Ala Leu Ser 
1590 1595 1600 160S - 

atg acg aag tac ggc teg gtg gac tec ggg ctg aac cgc gec ggg ctg 5024 
Met Thr Lys Tyr Gly Ser Val Asp Ser Gly Leu Asn Arg Ala Gly Leu 
1610 1615 1620 

gcg ccg ggc acc aac aag cac gec gtc gag ggc tec aac ccc atg tgg 5072 
Ala Pro Gly Thr Asn Lys His Ala Val Glu Gly Ser Asn Pro Met Trp 
1625 1630 1635 

aac gag gec ate cgc gcg ccc gac ttc gac gec ate agt gac gcg agt 5120 
Asn Glu Ala lie Arg Ala Pro Asp Phe Asp Ala lie Ser Asp Ala Ser 
1640 1645 1650 

ggc gac tec gac ctg ate ggc ate gag gac atg ccg caa ttc cgc gac 5168 
Gly Asp Ser Asp Leu lie Gly lie Glu Asp Met Pro Gin Phe Arg Asp 
1655 1660 1665 

gac tac ttc ccg ccc ggc gac aca gac tea age age ggc ate gtc ttg 5216 
Asp Tyr Phe Pro Pro Gly Asp Thr Asp Ser Ser Ser Gly He Val Leu 
1670 1675 1680 ^ 1685 

cac atg ggc gaa gec acg gac aac aag ccc gtg acc acg cat ggc aac 5264 
His Met Gly Glu Ala Thr Asp Asn Lys Pro Val Thr Thr His Gly Asn 
1690 1695 1700 

aac ttc ggg ttc aag tec acc ccg tac ctg cca cag ccg cac cca aag 5312 
Asn Phe Gly Phe Lys Ser Thr Pro Tyr Leu Pro Gin Pro His Pro Lys 
1705 1710 1715 

taactgccag ggtataacct gtccagggtg cctacgccgc gcgaagtgcg cacacgcgtt 5372 

tatcateggg aaacattagc atgaagatac ctatgtacat attgtaaatt gtaacatatc 54 32 

tatttttata caaatatatt ttatttatat ttgctaaaaa aaaaaaaaaa aaaaaaaaaa 54 92 

ctcgag 54 98 



<210> 2 
<211> 1717 
<212> PRT 

<213> Ostrinia nubilalis 



<400> 2 



Met 


Gly 


Val 


Glu Arg Phe 


Phe 


Pro 


Ala 


Val 


Leu Leu 


Val 


Ser Leu 


Ala 


1 






5 








10 






15 




Ser 


Ala 


Ala 


Leu Ala Asn 


Gin 


Arg 


Cys 


Ser 


Tyr He 


He 


Ala He 


Pro 








20 






25 








30 




Arg 


Pro 


Glu 


Thr Pro Glu 


Leu 


Pro 


Pro 


He .Asp Tyr 


Glu 


Gly Lys 


Ser 






35 






40 








45 






Trp 


Ser 


Glu 


Gin Pro Leu 


He 


Pro 


Gly 


Pro 


Thr Arg 


Glu 


Glu Val 


Cys 




50 






55 








60 








Met 


Glu 


Asn 


Phe Leu Pro 


Asp 


Gin 


Met 


lie 


Gin Val 


He 


Tyr Met 


Glu 


65 






70 










75 






80 


Glu 


Glu 


He 


Glu Gly Asp 


Val 


He 


He 


Ala 


Lys Leu 


Asn 


Tyr Gin 


Gly 








85 








90 






95 




Ser 


Asn 


Thr 


Pro Val Leu 


Ser 


He 


Met 


Ser 


Gly Gin 


Pro Arg Ala 


Gin 








100 






105 








110 




Leu 


Gly 


Pro 


Glu Phe Arg 


Gin 


Asn 


Glu 


Ala 


Asp Gly Gin Trp Ser 


Leu 






115 






120 








125 






Val 


He 


Thr 


Gin Arg Gin 


Asp 


Tyr 


Glu 


Thr 


Ala Thr 


Met 


Gin Ser 


Tyr 




130 




135 






140 








Val 


Phe 


Ser 


He Gin Val 


Glu 


Gly 


Glu 


Ser 


Gin Ala 


Val 


Leu Val 


Ala 



145 150 155 160 
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Leu Glu lie Val Asn lie Asp Asp Asn Pro Pro lie Leu Gin Val Val 

165 170 175 

Ser Ala Cys Val lie Pro Glu His Gly Glu Ala Arg Leu Thr Asp Cys 

160 185 190 

Val Tyr Gin Val Ser Asp Arg Asp Gly Glu lie Ser Thr Arg Phe Met 

195 200 205 

Thr Phe Arg Val Asp Ser Ser Arg Ala Ala Asp Glu Ser lie Phe Tyr 

210 215 220 

Met Val Gly Glu Tyr Asp Pro Ser Asp Trp Phe Asn Met Lys Met Thr 
225 230 235 240 

Val Gly lie Asn Ser Pro Leu Asn Phe Glu Thr Thr Gin Leu His lie 

245 250 255 

Phe Ser Val Thr Ala Ser Asp Ser Leu Pro Asn Asn His Thr Val Thr 

260 265 270 

Met Met Val Gin Val Glu Asn Val Glu Ser Arg Pro Pro Arg Trp Val 

275 280 285 

Glu lie Phe Ser Val Gin Gin Phe Asp Glu Lys Thr Asn Gin Ser Phe 

290 295 300 

Ser Leu Arg Ala He Asp Gly Asp Thr Gly He Asn Arg Ala He Asn 
305 310 315 320 

Tyr Thr Leu He Arg Asp Asp Ala Asp Asp Phe Phe Ser Leu Glu Val 

325 330 ^35 

He Glu Asp Gly Ala He Leu His Val Thr Glu He Asp Arg Asp Lys 

340 345 350 

Leu Glu Arg Glu Leu Phe Asn Leu Thr He Val Ala Tyr Lys Ser Thr 

355 360 365 

Asp Ala Ser Phe Ala Thr Glu Ala His He Phe He He Val Asn Asp 

370 375 380 

Val Asn Asp Gin Arg Pro Glu Pro Leu His Lys Glu Tyr Ser He Asp 
385 390 395 400 

He Met Glu Glu Thr Pro Met Thr Leu Asn Phe Asn Glu Glu Phe Gly 

405 410 415 

Phe His Asp Arg Asp Leu Gly Glu Asn Ala Gin Tyr Thr Val Glu Leu 

420 425 430 

Glu Asp Val Phe Pro Pro Gly Ala Ala Ser Ala Phe Tyr He Ala Pro 

435 440 445 

Gly Ser Gly Tyr Gin Arg Gin Thr Phe He Met Gly Thr He Asn His 

450 455 460 

Thr Met Leu Asp Tyr Glu Asp Val He Phe Gin Asn He lie He Lys 
465 470 475 480 

Val Lys Ala Val Asp Met Asn Asn Ala Ser His Val Gly Glu Ala Leu 

485 490 495 

Val Tyr Val Asn Leu He Asn Trp Asn Asp Glu Leu Pro He Phe Glu 

500 505 510 

Glu Ser Ser Tyr Ser Ala Ser Phe Lys Glu Thr Val Gly Ala Gly Phe 

515 520 525 

Pro Val Ala Thr Val Leu Ala Leu Asp Arg Asp He Asp Asp Val Val 

530 535 540 

Val His Ser Leu Met Gly Asn Ala Val Asp Tyr Leu Phe He Asp Glu 
545 550 555 560 

Ser Thr Gly Glu He Phe Val Ser Met Asp Asp Ala Phe Asp Tyr His 

565 570 575 

Arg Gin Asn Thr Leu Phe Val Gin Val Arg Ala Asp Asp Thr Leu Gly 

580 585 590 

Asp Gly Pro His Asn Thr Val Thr Thr Gin Leu Val He Glu Leu Glu 

595 600 605 

Asp Val Asn Asn Thr Pro Pro Thr Leu Arg Leu Pro Arg Ser Thr Pro 

610 615 620 

Ser Val Glu Glu Asn Val Pro Glu Gly Tyr Glu He Ser Arg Glu He 
625 630 635 640 

Thr Ala Thr Asp Pro Asp Thr Ser Ala Tyr Leu Trp Phe Glu He Asp 
645 650 655 
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Trp 


Asp 


Ser 


Thr 


Trp 


Ala 


Thr 


Lys 


Gin 


Gly 


Arg 


Glu 


Thr 


Asn 


Pro 


Thr 








660 










665 










670 






Glu 


Tyr 


Val 


Gly 


Cys 


lie 


Val 


He 


Glu 


Thr 


He 


Tyr 


Pro 


Thr 


Glu 


Gly 






675 










680 










685 








Asn 


Arg 


Gly 


Ser 


Ala 


He 


Gly 


Arg 


Leu 


Val 


Val 


Gin 


Glu 


He 


Arg 


Asp 




690 










695 










700 










Asn 


Val 


Thr 


He 


Asp 


Phe 


Glu 


Glu 


Phe 


Glu 


Met 


Leu 


Tyr 


Leu 


Thr 


Val 


705 










710 










715 










720 


Arg 


Val 


Arg 


Asp 


Leu 


Asn 


Thr 


Val 


He 


Gly 


Asp 


Asp 


Tyr 


Asp 


Glu 


Ala 










725 










730 










735 




Thr 


Phe 


Thr 


He 


Thr 


He 


He 


Asp 


Met 


Asn 


Asp 


Asn 


Ala 


Pro 


He 


Phe 








740 










745 










750 






Ala 


Asn 


Gly 


Thr 


Leu. 


Thr 


Gin 


Thr 


Met 


Arg 


Val 


Arg 


Glu 


Leu 


Ala 


Ala 






755 










760 










765 








Ser 


Gly 


Thr 


Leu 


He 


Gly 


Ser 


Val 


Leu 


Ala 


Thr 


Asp 


He 


Asp 


Gly 


Pro 




770 










775 










780 










Leu 


Tyr 


Asn 


Gin 


Val 


Arg 


Tyr 


Thr 


He 


Gin 


Pro 


Arg 


Asn 


Asn 


Thr 


Pro 


785 










790 










7 95 










800 


Glu Gly 


Leu 


Val 


Lys 


He 


Asp 


Phe 


Thr 


Thr 


Gly 


Gin 


He 


Glu 


Val 


Asp 










805 










810 










815 




Ala 


Asn 


Glu 


Ala 


He 


Asp 


Ala 


Asp 


Glu 


Pro 


Trp 


Arg 


Phe 


Tyr 


Leu 


Tyr 








820 










825 










830^ 






Tyr 


Thr 


Val 


He 


Ala 


Ser 


Asp 


Glu 


Cys 


Ser 


Leu 


Glu 


Asn 


Arg 


Thr 


Glu 






835 










840 










845 








Cys 


Pro 


Pro 


Asp 


Ser 


Asn 


Tyr 


Phe 


Glu 


Val 


Pro 


Gly 


Asp 


lie 


Glu 


He 




850 










855 










860 










Glu 


He 


He 


Asp 


Thr 


Asn 


Asn 


Lys 


Val 


Pro 


Glu 


Pro 


Leu 


Thr 


Glu 


Lys 


865 










870 










875 










880 


Phe 


Asn 


Thr 


Thr 


Val 


Tyr 


Val 


Trp 


Glu 


Asn 


Ala 


Thr 


Ser 


Gly 


Asp 


Glu 










885 










890 










895 




Val 


Val 


Gin 


Leu 


Tvr 


Ser 


His 


Asp 


Arg 


Asp 


Aro 


Asp 


Glu 


Leu 


Tvr 


His 








900 










905 










910 






Thr 


Val 


Arcr 


Tvr 


Thr 


Met 


Asn 


Phe 


Ala 


Val 


Asn 


Pro 


Arg 


Leu 


Arg 


Asp 






915 










920 










925 








Phe 


Phe 


Glu 


Val 


Asp 


Leu 


Asp 


Thr 


Gly 


Arg 


Leu 


Glu 


Val 


His 


Tyr 


Pro 




930 










935 










940 










Gly Asp 


Glu 


Lys 


Leu 


Asp 


Arg 


Asp 


Gly 


Asp 


Glu 


Pro 


Thr 


His 


Thr 


He 


945 










950 










955 










960 


Phe 


Val 


Asn 


Phe 


lie 


Asp 


Asn 


Phe 


Phe 


Ser 


Asp 


Gly 


Asp 


Gly 


Arg 


Arg 










965 










970 










975 




Asn 


Gin 


Asp 


Glu 


Val 


Glu 


lie 


Phe 


Val 


Val 


Leu 


Leu 


Asp 


Val 


Asn 


Asp 








980 










985 










990 






Asn 


Ala 


Pro 


Glu 


Met 


Pro 


Leu 


Pro 


Asp 


Glu 


Leu 


Arg 


Phe 


Asp 


Val 


Ser 






995 










1000 








1005 






Glu Gly 


Ala 


Val 


Ala 


Gly 


Val 


Arg 


Val 


Leu 


Pro 


Glu 


He 


Tyr 


Ala 


Pro 




1010 








1015 








1020 








Asp Arg Asp 


Glu 


Pro 


Asp 


Thr Asp Asn 


Ser 


Arg 


Val 


Gly Tyr 


Gly 


He 


1025 








1030 








1035 








1040 


Leu 


Asp 


Leu 


Thr 


He 


Thr 


Asp Arg 


Asp 


He 


Glu 


Val 


Pro 


Asp 


Leu 


Phe 










1045 








1050 








1055 


Thr 


Met 


He 


Ser 


He 


Glu 


Asn 


Lys 


Thr 


Gly 


Glu 


Leu 


Glu 


Thr 


Ala 


Met 








1060 








1065 








1070 




Asp 


Leu Arg 


Gly Tyr Trp Gly Thr Tyr Glu 


He 


Phe 


He 


Glu 


Ala 


Phe 






1075 








1080 








1085 




■ ) 


Asp His Gly 


Tyr 


Pro Gin Gin Arg 


Ser 


Asn 


Glu 


Thr 


Tyr 


Thr 


Leu 


Val 




1090 








1095 








1100 






He 


Arg 


Pro 


Tyr Asn 


Phe 


His 


His 


Pro 


Val 


Phe 


Val 


Phe 


Pro 


Gin Pro 


1105 








1110 








1115 








1120 


Asp 


Ser 


Val 


He Arg Leu Ser Arg Glu Arg 


Ala Thr Glu Gly Gly Val 










1125 








1130 








1135 


•Leu 


Ala 


Thr 


Ala 


Ala 


Asn 


Glu 


Phe 


Leu 


Glu 


Pro 


He Tyr Ala Thr Asp 



1140 1145 1150 
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Glu Asp Gly Leu His Ala Gly Ser Val Thr Phe His Val Gin Gly Asn 

1155 1160 1165 

Glu Glu Ala Val Gin Tyr Phe Asp He Thr Glu Val Gly Ala Gly Glu 

1170 1175 1180 

Asn Ser Gly Gin Leu He Leu Arg Gin Leu Phe Pro Glu Gin He Arg 
1185 1190 1195 1200 

Gin Phe Arg lie Thr He Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly 

1205 1210 1215 

Pro Leu Trp Thr Asp Val Thr Phe Ser Val Val Phe Val Pro Thr Gin 

1220 1225 1230 

Gly Asp Pro Val Phe Ser Glu Asn Ala Ala Thr Val Ala Phe Phe Glu 

1235 1240 1245 

Gly Glu Glu Gly Leu Arg Glu Ser Phe Glu Leu Pro Gin Ala Glu Asp 

1250 1255 1260 

Leu Lys Asn His Leu Cys Glu Asp Asp Cys Gin Asp He Tyr Tyr Arg 
1265 1270 1275 1280 

Phe He Asp Gly Asn Asn Glu Gly Leu Phe Val Leu Asp Gin Ser Ser 

1285 1290 1295 

Asn Val He Ser Leu Ala Gin Glu Leu Asp Arg Glu Val Ala Thr Ser 

1300 1305 1310 

Tyr Thr Leu His He Ala Ala Ser Asn Ser Pro Asp Ala Thr Gly He 

1315 1320 1325 - - 

Pro Leu Gin Thr Ser He Leu Val Vai Thr Val Asn Val Arg Glu Ala 

1330 1335 1340 

Asn Pro Arg Pro He Phe Glu Gin Asp Leu Tyr Thr Ala Gly He Ser 
1345 1350 1355 1360 

Thr Leu Asp Ser He Gly Arg Glu Leu Leu Thr Val Arg Ala Ser His 

1365 1370 1375 

Thr Glu Asp Asp Thr He Thr Tyr Thr He Asp Arg Ala Ser Met Gin 

1380 1385 1390 

Leu Asp Ser Ser Leu Glu Ala Val Arg Asp Ser Ala Phe Ala Leu His 

1395 1400 1405 

Ala Thr Thr Gly Val Leu Ser Leu Asn Met Gin Pro Thr Ala Ser Met 

1410 1415 1420 

His Gly Met Phe Glu Phe Asp Val He Ala Thr Asp Thr Ala Ser Ala 
1425 1430 1435 1440 

He Asp Thr Ala Arg Val Lys Val Tyr Leu He Ser Ser Gin Asn Arg 

1445 1450 1455 

Val Thr Phe He Phe Asp Asn Gin Leu Glu Thr Val Glu Gin Asn Arg 

1460 1465 1470 

Asn Phe He Ala Ala Thr Phe Ser Thr Gly Phe Asn Met Thr Cys Asn 

1475 1480 1485 

He Asp Gin Val Val Pro Phe Ser Asp Ser Ser Gly Val Ala Gin Asp 

1490 1495 1500 

Asp Thr Thr Glu Val Arg Ala His Phe He Arg Asp Asn Val Pro Val 
1505 1510 1515 1520 

Gin Ala Gin Glu Val Glu Ala Val Arg Ser Asp Thr Val Leu Leu Arg 

1525 1530 1535 

Thr He Gin Leu Met Leu Ser Thr Asn Ser Leu Val Leu Gin Asp Leu 

1540 1545 1550 

Val Thr Gly Asp Thr Pro Thr Leu Gly Glu Glu Ser Met Gin He Ala 

1555 1560 1565 

Val Tyr Ala Leu Ala Ala Leu Ser Ala Val Leu Gly Phe Leu Cys Leu 

1570 1575 1580 

Val Leu Leu Leu Ala Leu Phe Cys Arg Thr Arg Ala Leu Asn Arg Gin 
1585 1590 1595 1600 

Leu Gin Ala Leu Ser Met Thr Lys Tyr Gly Ser Val Asp Ser Gly Leu 

1605 1610 1615 

Asn Arg Ala Gly Leu Ala Pro Gly Thr Asn Lys His Ala Val Glu Gly 

1620 1625 1630 

Ser Asn Pro Met Trp Asn Glu Ala He Arg Ala Pro Asp Phe Asp Ala 
1635 1640 1645 
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lie 


Ser 


Asp 


Ala 


Ser Gly Asp 


Ser 


Asp 


Leu lie Gly 


He Glu 


Asp 


Met 




1650 




1655 




1660 






Pro 


Gin 


Phe 


Arg 


Asp Asp Tyr 


Phe 


Pro 


Pro Gly Asp 


Thr Asp 


Ser 


Ser 


1665 






1670 






1675 






1680 


Ser 


Gly 


lie 


Val 


Leu His Met 


Gly 


Glu 


Ala Thr Asp 


Asn Lys 


Pro 


Val 










1685 






1690 




1695 


Thr 


Thr 


His 


Gly 


Asn Asn Phe 


Gly 


Phe 


Lys Ser Thr 


Pro Tyr 


Leu 


Pro 








1700 




1705 


1710 




Gin 


Pro 


His 


Pro 


Lys 















1715 



<210> 3 
<211> 5527 
<212> DNA 

<213> Heliothis zea 



<220> 
<221> CDS 

<222> (171) ... (5360) 
<400> 3 

gtggattgtt gttctaaaaa cagaaaaaaa acgcagtttg aaaaaagtta tttttgtgat 60 
atttgtgtaa agtgtagtgt taaataattt ggcattgctg taaaggatta aagagtgttc 120 
caattgatca cccagaggtg gatcgaccag actagacaca gaactatgag atg gca 176 

Met Ala 
1 

gtc gac gtg aga ata ttg acg gca gcg gtt ttc att ate get get cac 224 
Val Asp Val Arg He Leu Thr Ala Ala Val Phe lie He Ala Ala His 
5 10 15 

ttg act ttc gcg caa gat tgt age tac atg gta gca ata ccc aga cca 272 
Leu Thr Phe Ala Gin Asp Cys Ser Tyr Met Val Ala lie Pro Arg Pro 
20 25 30 

gag cga cca gat ttt cca agt eta aat ttc gat gga ata cca tgg agt 320 
Glu Arg Pro Asp Phe Pro Ser Leu Asn Phe Asp Gly He Pro Trp Ser 
35 40 45 50 

egg tat ccc ctg ata cca gtg gag ggt aga gaa gat gtg tgc atg aac 368 
Arg Tyr Pro Leu lie Pro Val Glu Giy Arg Glu Asp Val Cys Met Asn 
55 60 65 

gaa ttc cag cca gat gee ttg aac cca gtt ace gtc ate ttc atg gag 416 
Glu Phe Gin Pro Asp Ala Leu Asn Pro Val Thr Val He Phe Met Glu 
70 75 80 

gag gag ata gaa ggg gat gtg get ate gcg agg ctt aac tac cga ggt 464 
Glu Glu He Glu Gly Asp Val Ala He Ala Arg Leu Asn Tyr Arg Gly 
85 90 95 

acc aat act ccg ace att gta tct cca ttt age ttt ggt act ttt aac 512 
Thr Asn Thr Pro Thr He Val Ser Pro Phe Ser Phe Gly Thr Phe Asn 
100 105 110 

atg ttg ggg ccg gtc ata cgt aga ata cct gag aat ggt ggc gac tgg 560 
Met Leu Gly Pro Val He Arg Arg He Pro Glu Asn Gly Gly Asp Trp 
115 120 125 130 

cat etc gtc att aca cag aga cag gac tac gag acg cca ggt atg cag 608 
His Leu Val He Thr Gin Arg Gin Asp Tyr Glu Thr Pro Gly Met Gin 
135 140 145 
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cag tac ate ttc gac gtg agg gta gac gat gaa ccg eta gtg gec acg - - ~—€SS 
Gin Tyr lie Phe Asp Val Arg Val Asp Asp Glu Pro Leu Val Ala Thr 
150 155 160 

gtg atg ctg etc att gtc aac ate gat gac aac gat cct ate ata cag 704 
Val Met Leu Leu lie Val Asn lie Asp Asp Asn Asp Pro lie lie Gin 
165 170 175 

atg ttt gag cct tgt gat att cct gaa cgc ggt gaa aca ggc ate aca 752 
Met Phe Glu Pro Cys Asp lie Pro Glu Arg Gly Glu Thr Gly lie Thr 
180 185 " 190 

tea tgc aag tac ace gtg age gat get gac ggc gag ate agt aca cgt 800 
Ser Cys Lys Tyr Thr Val Ser Asp Ala Asp Gly Glu lie Ser Thr Arg 
195 200 205 210 

ttc atg agg ttc gaa ate age age gat cga gac gat gac gaa tat ttc " 848 

Phe Met Arg Phe Glu lie Ser Ser Asp Arg Asp Asp Asp Glu Tyr Phe 
215 220 225 

gaa etc gtc aga gaa aat ata caa gga caa tgg atg tat gtt tat atg 896 
Glu Leu Val Arg Glu Asn lie Gin Gly Gin Trp Met Tyr Val His Met 
230 235 240 

aga gtt cac gtc aaa aaa cct ctt gat tat gag gaa aac ccg eta cat 94 4 

Arg Val His Val Lys Lys Pro Leu Asp Tyr Glu Glu Asn Pro Leu His 
245 250 255 

ttg ttt aga gtt aca get tat gat tec eta cca aac aca cat aca gtg 992 
Leu Phe Arg Val Thr Ala Tyr Asp Ser Leu Pro Asn Thr His Thr Val 
260. 265 270 

acg atg atg gtg caa gta gag aac gtt gag aac aga ccg ccg cga tgg 1040 
Thr Met Met Val Gin Val Glu Asn Val Glu Asn Arg Pro Pro Arg Trp 
275 280 285 290 

atg gag ata ttt get gtc cag cag ttc gat gag aag acg gaa caa tec 1088 
Met Glu lie Phe Ala Val Gin Gin Phe Asp Glu Lys Thr Glu Gin Ser 
295 ,300 305 

ttt agg gtt cga gee ate gat gga gat acg gga ate gat aaa cct att 1136 
Phe Arg Val Arg Ala lie Asp Gly Asp Thr Gly lie Asp Lys Pro He 
310 315 320 

ttc tat agg ate gaa act gaa aaa gga gag gaa gac ttg ttc age att 1184 
Phe Tyr Arg He Glu Thr Glu Lys Gly Glu Glu Asp Leu Phe Ser lie 
325 330 335 

caa acg ata gaa ggt ggt cga gaa ggc get tgg ttt aac gtc get cca 1232 
Gin Thr He Glu Gly Gly Arg Glu Gly Ala Trp Phe Asn Val Ala Pro 
340 345 350 

ata gac agg gac act eta gag aag gaa gtt ttc cac gtg tec ata ata 1280 
He Asp Arg Asp Thr Leu Glu Lys Glu Val Phe His Val Ser He He 
355 360 365 370 

gcg tac aaa tat ggc gat aat gac gtg gaa ggc agt teg tea ttc cag 1328 
Ala Tyr Lys Tyr Gly Asp Asn Asp Val Glu Gly Ser Ser Ser Phe Gin 
375 380 385 

teg aaa ace gat gtg gtc ate ate gtg aac gat gtc aat gat cag gcg 1376 
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Ser Lys Thr Asp Val Val lie lie Val Asn Asp Val Asn Asp Gin Ala 

390 395 400 - - ' 

ccg ctt cct ttc egg gaa gag tac tec att gaa att atg gag gaa act 1424 
Pro Leu Pro Phe Arg Glu Glu Tyr Ser He Glu He Met Glu Glu Thr 
405 410 415 

gcg atg ace ctg aat tta gaa gac ttt ggg ttc cat gat aga gat etc 1472 
Ala Met Thr Leu Asn Leu Glu Asp Phe Gly Phe His Asp Arg Asp Leu 
420 425 430 

ggt cct cac gca caa tac aca gta cac tta gag age ate cat cct ccc 1520 
Gly Pro His Ala Gin Tyr Thr Val His Leu Glu Ser lie His Pro Pro 
435 " 440 445 450 

cga get cac gag gcg ttc tac ata gca ccg gag gtt ggc tac cag cgc 1568 
Arg Ala His Glu Ala Phe Tyr He Ala Pro Glu Val Gly Tyr Gin Arg 
455 460 465 

cag tec ttc att atg ggc acg cag aac cat cac atg ctg gac ttc gaa 1616 
Gin Ser Phe He Met Gly Thr Gin Asn His His Met Leu Asp Phe Glu 
470 475 480^ 

gtg cca gag ttc cag aat ata caa ctg agg gee gta gcg ata gac atg 1664 
Val Pro Glu Phe Gin Asn He Gin Leu Arg Ala Val Ala He Asp Met 
485 490 495 

gac gat ccc aaa tgg gtg ggt ate gcg ata ate aac att aaa ctg ate 1712 
Asp Asp Pro Lys Trp Val Gly He Ala He lie Asn He Lys Leu He 
500 505 510 

aac tgg aac gat gag ctg ccg atg ttc gag agt gac gtg caa act gtc 1760 
Asn Trp Asn Asp Glu Leu Pro Met Phe Glu Ser Asp Val Gin Thr Val 
515 520 525 530 

age ttc gat gag aca gag ggc gca ggc ttc tat gtg gee act gtt gtg 1808 
Ser Phe Asp Glu Thr Glu Gly Ala Gly Phe Tyr Val Ala Thr Val Val 
535 540 545 

gcg aag gac egg gat gtt ggt gat aaa. gtc gaa cac tct eta atg ggt 1856 
Ala Lys Asp Arg Asp Val Gly Asp Lys Val Glu His Ser Leu Met Gly 
550 555 560 

aac gca gta age tac ctg agg ate gac aag gaa ace ggc gag ata ttc 1904 
Asn Ala Val Ser Tyr Leu Arg He Asp Lys Glu Thr Gly Glu He Phe 
565 570 575 

gtc aca gaa aac gaa gca ttc aac tat cac agg cag aac gaa etc ttt 1952 
Val Thr Glu Asn Glu Ala Phe Asn Tyr His Arg Gin Asn Glu Leu Phe 
580 585 590 

gtg cag ata cca get gac gac acg ctg ggc gag cct tac aac ace aac 2000 
Val Gin He Pro Ala Asp Asp Thr Leu Gly Glu Pro Tyr Asn Thr Asn 
595 600 605 610 



act act cag ttg gtg ate aag ctg egg gac att aac aac ace cct cct 
Thr Thr Gin Leu Val He Lys Leu Arg Asp He Asn Asn Thr Pro Pro 
615 620 625 



2048 



acg etc agg ctg cct cgc gee act cca tea gtg gaa gag aac gtg ccc 2096 
Thr Leu Arg Leu Pro Arg Ala Thr Pro Ser Val Glu Glu Asn Val Pro 
630 635 640 
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gac ggg ttt gtg ate ccc acg cag ctg cac gec acg gac ccc gac act 
Asp Gly Phe Val lie Pro Thr Gin Leu His Ala Thr Asp Pro Asp Thr 
645 650 655 



aca get gag ctg cgc ttc gag ate gac tgg cag aac teg tat get ace 
Thr Ala Glu Leu Arg Phe Glu lie Asp Trp Gin Asn Ser Tyr Ala Thr 
660 . 665 670 

aag cag gga egg aat act gac tct aag gag tat ate ggt tgt ata gaa 
Lys Gin Gly Arg Asn Thr Asp Ser Lys Glu Tyr lie Gly Cys lie Glu 
675 680 685 690 

ate gag acg ata tac ccg aat ata aac cag cga ggc aac gee ate ggc 
lie Glu Thr lie Tyr Pro Asn lie Asn Gin Arg Gly Asn Ala lie Gly 
695 700 705 

cgc gtg gta gtg cga gag ate egg gac ggc gtc acc ata gac tat gag 
Arg Val Val Val Arg Glu lie Arg Asp Gly Val Thr He Asp Tyr Glu 
710 715 720 

atg ttt gaa gtt eta tac etc acc gtc att gtg agg gat ctc^aac acc 
Met Phe Glu Val Leu Tyr Leu Thr Val He Val Arg Asp Leu Asn Thr 
725 730 735 

gtt att gga gaa gac cat gat ata tec aca ttc acg ate acg ata ata 
Val He Gly Glu Asp His Asp lie Ser Thr Phe Thr lie Thr lie He 
740 745 750 

gac atg aac gac aac cct ccc ctg tgg gtg gaa ggc acc ctg acg caa 
Asp Met Asn Asp Asn Pro Pro Leu Trp Val Glu Gly Thr Leu Thr Gin 
755 760 765 770 



gag ttc cgt gtg cga gag gtg gca gee tea gga gtt gtt ata gga tec 

Glu Phe Arg Val Arg Glu Val Ala Ala Ser Gly Val Val He Gly Ser 
775 780 785 

gta ctg gee act gat ate gac gga ccg ctg tat aat caa gtg egg tat 

Val Leu Ala Thr Asp He Asp Gly Pro Leu Tyr Asn Gin Val Arg Tyr 
790 795 800 

act att act ccc aga eta gac act cca gaa gac eta gtg gac ata gac 

Thr He Thr Pro Arg Leu Asp Thr Pro Glu Asp Le\i Val Asp lie Asp 

805 810 815 

ttc aac acg ggt cag ate tec gta aag tta cac cag get ata gac gcg 

Phe Asn Thr Gly Gin He Ser Val Lys Leu His Gin Ala He Asp Ala 
820 825 830 

gac gag ccg ccg cgt cag aac etc tac tac acc gtc ata get agt gac 

Asp Glu Pro Pro Arg Gin Asn Leu Tyr Tyr Thr Val He Ala Ser Asp 
835 840 845 850 

aag tgt gac etc ctt act gtc act gag tgt ccg cct gac cct act tac 

Lys Cys Asp Leu Leu Thr Val Thr Glu Cys Pro Pro Asp Pro Thr Tyr 
855 860 865 

ttt gag aca ccg gga gag att acc ate cac ata acg gac acg aac aac 

Phe Glu Thr Pro Gly Glu He Thr lie His He Thr Asp Thr Asn Asn 
870 875 880 

aag gtg cct caa gtg gaa gac gac aag ttc gag gcg acg gtg tac ate 
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Lys Val Pro Gin Val Glu Asp Asp Lys Phe Glu Ala Thr Val Tyr Tie 
885 890 895 



tac gag ggc gcg gac gat gga caa cat gtc gtg cag ate tac gec age 
Tyr Glu Gly Ala Asp Asp Gly Gin His Val Val Gin lie Tyr Ala Ser 
900 905 910 



2912 



gat ctg gat aga gat gaa ate tac cac aaa gtg age tac cag ate aac 
Asp Leu Asp Arg Asp Glu lie Tyr His Lys Val Ser Tyr Gin lie Asn 
915 920 925 930 



2960 



tac gcg ate aac tct cgt etc cgc gac ttc ttc gag atg gac ctg gag 
Tyr Ala lie Asn Ser Arg Leu Arg Asp Phe Phe Glu Met Asp Leu Glu 
935 94 0 945 



3008 



tec ggc etc gtg tac gtc aac aac ace gee ggc gag ctg ctg gac agg 
Ser Gly Leu Val Tyr Val Asn Asn Thr Ala Gly Glu Leu Leu Asp Arg 
950 955 960 



3056 



gac ggc gac gag ccc aca cat cgc ate ttc ttc aat gtc ate gat aac 
Asp Gly Asp Glu Pro Thr His Arg lie Phe Phe Asn Val He Asp Asn 
965 970 975 ' *■ 



3104 



ttc tat gga gaa gga gat ggc aac cgc aat cag aac gag aca caa gtg 
Phe Tyr Gly Glu Gly Asp Gly Asn Arg Asn Gin Asn Glu Thr Gin Val 
980 985 990 



3152 



tta gta gta ttg ctg gac ate aat gac aac tat ccg gaa ctg cct gaa 
Leu Val Val Leu Leu Asp He Asn Asp Asn Tyr Pro Glu Leu Pro Glu 
995 1000 1005 1010 



3200 



act ate cca tgg get ate tct gag age tta gag ctg ggt gag cgt gta 
Thr lie Pro Trp Ala He Ser Glu Ser Leu Glu Leu Gly Glu Arg Val 
1015 1020 1025 



3248 



cag cca gaa ate ttt gec egg gac cgc gac gaa ccc gga aca gac aac 
Gin Pro Glu He Phe Ala Arg Asp Arg Asp Glu Pro Gly Thr Asp Asn 
1030 1035 1040 



3296 



tec cgc gtc gec tat gee ate aca ggc etc gec age act gac egg gac 
Ser Arg Val Ala Tyr Ala He Thr Gly Leu Ala Ser Thr Asp Arg Asp 
1045 1050 1055 



3344 



ata caa gtg cct aat etc ttc aac atg ate act ata gag agg gac agg 
He Gin Val Pro Asn Leu Phe Asn Met He Thr He Glu Arg Asp Arg 
1060 1065 1070 



3392 



gga att gat cag aca gga ata ctt gag gca get atg gat ttg aga ggc 
Gly He Asp Gin Thr Gly He Leu Glu Ala Ala Met Asp Leu Arg Gly 
1075 1080 1085 1090 



3440 



tat tgg ggc ace tat caa ata gat att cag gcg tat gac cat gga ata 
Tyr Trp Gly Thr Tyr Gin He Asp He Gin Ala Tyr Asp His Gly He 
1095 1100 1105 



3488 



cct caa agg att tea aat cag aag tac ccg ctg gtg_ att aga cct tac 
Pro Gin Arg He Ser Asn Gin Lys Tyr Pro Leu Val He Arg Pro Tyr 
1110 1115 1120 



3536 



aac ttc cac gac cca gtg ttc gtg ttc cct caa cct gga tec act ate 
Asn Phe His Asp Pro Val Phe Val Phe Pro Gin Pro Gly Ser Thr He 
1125 1130 « . 1135 
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aga ctg gca aag gag cga gca gta gtc aac ggt ata ctg get aca gta 
Arg Leu Ala Lys Glu Arg Ala Val Val Asn Gly lie Leu Ala Thr Val 
1140 1145 1150 



3-632 



gac ggc gaa ttt ctg gac aga ate gtt gec acc gac gag gat ggt tta 
Asp Gly Glu Phe Leu Asp Arg He Val Ala Thr Asp Glu Asp Gly Leu 
1155 1160 1165 1170 



3680 



gaa get gga ctt gtc aca ttc tct ate gee gga gat gat gaa gat get 
Glu Ala Gly Leu Val Thr Phe Ser He Ala Gly Asp Asp Glu Asp Ala 
1175 1180 1185 



3728 



cag ttc ttc gac gtg ttg aac gac gga gtg aac teg ggt get etc acc 
Gin Phe Phe Asp Val Leu Asn Asp Gly Val Asn Ser Gly Ala Leu Thr 
1190 1195 1200 



3776 



etc acg egg etc ttc cct gaa gag ttc cga gag ttc cag gtg acg att 
Leu Thr Arg Leu Phe Pro Glu Glu Phe Arg Glu Phe Gin Val Thr He 
1205 1210 1215 



3824 



cgt get acg gac ggt gga act gag cct ggt cca agg agt acg ^gac tgc 
Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly Pro Arg Ser Thr Asp Cys 
1220 1225 1230 



3872 



ttg gtg acc gta gtg ttt gta ccc acg cag gga gag ccc gtg ttc gag 
Leu Val Thr Val Val Phe Val Pro Thr Gin Gly Glu Pro Val Phe Glu 
1235 1240 1245 1250 



3920 



gat agg act tac acg gtt get ttt gtt gaa aaa gat gag ggt atg tta 
Asp Arg Thr Tyr Thr Val Ala Phe Val Glu Lys Asp Glu Gly Met Leu 
1255 1260 1265 



3968 



gag gag gcg gaa eta cct cgc gee tea gac cca agg aac ate atg tgt 
Glu Glu Ala Glu Leu Pro Arg Ala Ser Asp Pro Arg Asn He Met Cys 
1270 1275 1280 



4016 



gaa gat gat tgt cac gac acc tat tac age att gtt gga ggc aat teg 
Glu Asp Asp Cys His Asp Thr Tyr Tyr Ser He Val Gly Gly Asn Ser 
1285 1290 1295 



4064 



ggt gaa cac ttc aca gta gac cct cgt acc aac gtg eta tec ctg gtg 
Gly Glu His Phe Thr Val Asp Pro Arg Thr Asn Val Leu Ser Leu Val 
1300 1305 1310 



4112 



aag ccg ctg gac cgc tec gaa cag gag aca cac acc etc ate att gga 
Lys Pro Leu Asp Arg Ser Glu Gin Glu Thr His Thr Leu He He Gly 
1315 1320 1325 1330 



4160 



gee age gac act ccc aac ccg gec gee gtc ctg cag get tct aca etc 
Ala Ser Asp Thr Pro Asn Pro Ala Ala Val Leu Gin Ala Ser Thr Leu 
1335 1340 1345 



4208 



act gtc act gtt aat gtt cga gaa gcg aac ccg cga cca gtg ttc caa 
Thr Val Thr Val Asn Val Arg Glu Ala Asn Pro Arg Pro Val Phe Gin 
1350 1355 1360 



4256 



aga gca etc tac aca get ggc ate tct get ggc gat ttc ate gaa aga 
Arg Ala Leu Tyr Thr Ala Gly He Ser Ala Gly Asp Phe He Glu Arg 
1365 1370 1375 



4304 



aat ctg ctg act tta gta gcg aca cat tea gaa gat ctg ccc ate act 
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Asn Leu Leu Thr Leu Val Ala Thr His Ser Glu Asp Leu Pro lie Thr 

1380 1385 1390 _ - - 

tac act ctg ata caa gag tec atg gaa gca gac ccc aca etc gaa get 4 400 

Tyr Thr Leu lie Gin Glu Ser Met Glu Ala Asp Pro Thr Leu Glu Ala 
1395 1400 1405 1410 

gtt cag gag tea gec ttc ate etc aac cct gag act gga gtc ctg tec 4 448 

Val Gin Glu Ser Ala Phe lie Leu Asn Pro Glu Thr Gly Val Leu Ser 
1415 1420 1425 

etc aac ttc cag cca ace gee tec atg cac ggc atg ttc gag ttc gaa 4496 
Leu Asn Phe Gin Pro Thr Ala Ser Met His Gly Met Phe Glu Phe Glu 
1430 1435 1440 

gtc aaa gee act gat tea agg aca gaa act gee cgc acg gaa gtg aag 4 544 

Val Lys Ala Thr Asp Ser Arg Thr Glu Thr Ala Arg Thr Glu Val Lys 
1445 1450 1455 

gtg tac ctg ata tea gac cgc aac cga gtg ttc ttc acg ttc aat aac 4592 
Val Tyr Leu lie Ser Asp Arg Asn Arg Val Phe Phe Thr Phe Asn Asn 
1460 1465 1470 

cca ctg cct gaa gtc aca ccc cag gaa gat ttc ata gcg gag acg ttc 4 640 

Pro Leu Pro Glu Val Thr Pro Gin Glu Asp Phe lie Ala Glu Thr Phe 
1475 1480 1485 1490 

C 

acg gca ttc ttc ggc atg acg tgc aac ate gac cag teg tgg tgg gec 4 688 

Thr Ala Phe Phe Gly Met Thr Cys Asn lie Asp Gin Ser Trp Trp Ala 
1495 1500 1505 

age gat ccc gtc acc ggc gee acc aag gac gac cag act gaa gtc agg 4 736 

Ser Asp Pro Val Thr Gly Ala Thr Lys Asp Asp Gin Thr Glu Val Arg 
1510 1515 1520 

get cat ttc ate agg gac gac ctt ccc gtg cct get gag gag att gaa 4784 
Ala His Phe lie Arg Asp Asp Leu Pro Val Pro Ala Glu Glu lie Glu 
1525 1530 1535 

cag tta cgc ggt aac cca act eta gU aat age ate caa cga gee ctg 4832 
Gin Leu Arg Gly Asn Pro Thr Leu Val Asn Ser lie Gin Arg Ala Leu 
1540 1545 1550 

gag gaa cag aac ctg cag eta gee gac ctg ttc acg ggc gag acg ccc 4880 
Glu Glu Gin Asn Leu Gin Leu Ala Asp Leu Phe Thr Gly Glu Thr Pro 
1555 1560 1565 } 1570 

ate etc ggc ggc gac gcg cag get cga gee ctg tac gcg ctg gcg gcg 4 928 

lie Leu Gly Gly Asp Ala Gin Ala Arg Ala Leu Tyr Ala Leu Ala Ala 
1575 1580 1585 

gtg gcg gcg gca etc gcg ctg att gtt gtt gtg ctg ctg att gtg ttc 4 976 

Val Ala Ala Ala Leu Ala Leu lie Val Val Val Leu Leu lie Val Phe 
1590 1595 1600 

ttt.gtt agg act agg act ctg aac egg cgc ttg caa get ctg tec atg 5024 
Phe Val Arg Thr Arg Thr Leu Asn Arg Arg Leu Gin Ala Leu Ser Met 
1605 1610 1615 

acc aag tac agt teg caa gac tct ggg ttg aac cgc gtg ggt ttg gcg 5072 
Thr Lys Tyr Ser Ser Gin Asp Ser Gly Leu Asn Arg Val Gly Leu Ala 
1620 1625 1630 
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gcg ccg ggc ace aat aag cac get gtc gag ggc tec aac ccc ate tgg _ „ ;5>20 
Ala Pro Gly Thr Asn Lys His Ala Val Glu Gly Ser Asn Pro He Trp 
1635 1640 1645 1650 

aat gaa acg ttg aag get ccg gac ttt gac get ctt age gag cag teg 5168 
Asn Glu Thr Leu Lys Ala Pro Asp Phe Asp Ala Leu Ser Glu Gin Ser 
1655 1660 1665 

tac gac tea gac eta ate ggc ate gaa gac ttg ccg cag ttc agg aac 5216 
Tyr Asp Ser Asp Leu He Gly He Glu Asp Leu Pro Gin Phe Arg Asn 
1670 1675 1680 



gac tac ttc cca cct gag gag ggc age tec atg cga gga gtc gtc aat 5264 
Asp Tyr Phe Pro Pro Glu Glu Gly Ser Ser Met Arg Gly Val Val Asn 
1685 1690 1695 

gaa cac gtg cct gaa tea ata gca aac cat aac aac aac ttc ggg ttt -5312 
Glu His Val Pro Glu Ser He Ala Asn His Asn Asn Asn Phe Gly Phe 
1700 1705 1710 

aac tct act ccc ttc age cca gag ttc gcg aac acg cag tfce^aga aga 5360 
Asn Ser Thr Pro Phe Ser Pro Glu Phe Ala Asn Thr Gin Phe Arg Arg 
1715 1720 1725 1730 



taaaatatta aagcatttta aattataata ttatgtaccg gtgaaatacc atacttatat 5420 
ttacctaagt atatattaaa gtgagattaa gtaagatact cgtattaatt aagagcattt 54 80 
atttttttaa atacaaaaca attaaactaa aaaaaaaaaa aaaaaaa 5527 

<210> 4 
<211> 1730 
<212> PRT 

<213> Heliothis zea 



<400> 4 



Met 


Ala 


Val 


Asp 


Val 


Arg 


He 


Leu 


Thr 


Ala 


Ala 


Val 


Phe 


He 


He 


Ala 


1 








5 










10 










15 




Ala 


His 


Leu 


Thr 
20 


Phe 


Ala 


Gin 


Asp 


Cys 
25 


Ser 


Tyr 


Met 


Val 


Ala 
30 


He 


Pro 


Arg 


Pro 


Glu 

35 


Arg 


Pro 


Asp 


Phe 


Pro 
40 


Ser 


Leu 


Asn 


Phe 


Asp 
45 


Gly 


He 


Pro 


Trp 


Ser^Arg 


Tyr 


Pro 


Leu 


He 


Pro 


Val 


Glu 


Gly 


Arg 


Glu 


Asp 


Val 


Cys 




50 










55 










60. 










Met 


Asn 


Glu 


Phe 


Gin 


Pro 


Asp 


Ala 


Leu 


Asn 


Pro 


Val 


Thr 


Val 


He 


Phe 


.65 










70 










75 










80 


Met 


Glu 


Glu 


Glu 


He 
85 


Glu 


Gly 


Asp 


Val 


Ala 
90 


He 


Ala 


Arg 


Leu 


Asn 
95 


Tyr 


Arg 


Gly Thr 


Asn 


Thr 


Pro 


Thr 


He 


Val 


Ser 


Pro 


Phe 


Ser 


Phe 


Gly 


Thr 








100 










105 










110 






Phe 


Asn 


Met 
115 


Leu 


Gly 


Pro 


Val 


He 
120 


Arg 


Arg 


He 


Pro 


Glu 
125 


Asn 


Gly 


Gly 


Asp 


Trp 
130 


His 


Leu 


Val 


He 


Thr 
135 


Gin 


Arg 


Gin 


Asp 


Tyr 
140 


Glu 


Thr 


Pro 


Gly 


Met 


Gin 


Gin 


Tyr 


He 


Phe 


Asp 


Val 


Arg 


Val 


Asp 


Asp 


Glu 


Pro 


Leu 


Val 


145 










150 










155 










160 


Ala 


Thr 


Val 


Met 


Leu 
165 


Leu 


He 


Val 


Asn 


He 
170 


Asp 


Asp 


Asn 


Asp 


Pro 
175 


He 


He 


Gin 


Met 


Phe 
180 


Glu 


Pro 


Cys 


Asp 


He 
185 


Pro 


Glu 


Arg 


Gly 


Glu 
190 


Thr 


Gly 


He 


Thr 


Ser 
195 


Cys 


Lys 


Tyr 


Thr 


Val 
200 


Ser 


Asp 


Ala 


Asp 


Gly 
205 


Glu 


He 


Ser 


Thr 


Arg 


Phe 


Met 


Arg 


Phe 


Glu 


He 


Ser 


Ser 


Asp 


Arg 


Asp 


Asp 


Asp 


Glu 
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210 215 220 

Tyr Phe Glu Leu Val Arg Glu Asn lie Gin Gly Gin Trp Met Tyr Val . 
225 230 235 240 

His Met Arg Val His Val Lys Lys Pro Leu Asp Tyr Glu Glu Asn Pro 

245 250 255 

Leu His Leu Phe Arg Val Thr Ala Tyr Asp Ser Leu Pro Asn Thr His 

260 265 270 

Thr Val Thr Met Met Val Gin Val Glu Asn Val Glu Asn Arg Pro Pro 

275 280 285 

Arg Trp Met Glu lie Phe Ala Val Gin Gin Phe Asp Glu Lys Thr Glu 

290 295 300 

Gin Ser Phe Arg Val Arg Ala lie Asp Gly Asp Thr Gly lie Asp Lys 
305 310 315 320 

Pro lie Phe Tyr Arg lie Glu Thr Glu Lys Gly Glu Glu Asp Leu Phe 

325 330 335 

Ser lie Gin Thr lie Glu Gly Gly Arg Glu Gly Ala Trp Phe Asn Val 

340 345 350 

Ala Pro lie Asp Arg Asp Thr Leu Glu Lys Glu Val Phe His Val Ser 

355 360 365 

lie lie Ala Tyr Lys Tyr Gly Asp Asn Asp Val Glu Gly Ser Ser Ser 

370 375 380 

Phe Gin Ser Lys Thr Asp Val Val lie He Val Asn Asp Val^Asn Asp 
385 390 395 400 

Gin Ala Pro Leu Pro Phe Arg Glu Glu Tyr Ser He Glu lie Met Glu 

405 410 415 

Glu Thr Ala Met Thr Leu Asn Leu Glu Asp Phe Gly Phe His Asp Arg 

420 425 430 

Asp Leu Gly Pro His Ala Gin Tyr Thr Val His Leu Glu Sef lie His 

435 440 445 

Pro Pro Arg Ala His Glu Ala Phe Tyr He Ala Pro Glu Val Gly Tyr 

450 455 460 

Gin Arg Gin Ser Phe He Met Gly Thr Gin Asn His His Met Leu Asp 
465 470 475 480 

Phe Glu Val Pro Glu Phe Gin Asn He Gin Leu Arg Ala Val Ala He 

485 490 495 

Asp Met Asp Asp Pro Lys Trp Val Gly lie Ala He He Asn He Lys 

500 505 510 

Leu He Asn Trp Asn Asp Glu Leu Pro Met Phe Glu Ser Asp Val Gin 

515 520 525 

Thr Val Ser Phe Asp Glu Thr Glu Gly Ala Gly Phe Tyr Val Ala Thr 

530 535 540 

Val Val Ala Lys Asp Arg Asp Val Gly Asp Lys Val Glu His Ser Leu 
545 550 555 560 

Met Gly Asn Ala Val Ser Tyr Leu Arg He Asp Lys Glu Thr Gly Glu 

565 570 575 

He Phe Val Thr Glu Asn Glu Ala Phe Asn Tyr His Arg Gin Asn Glu 

580 585 590 

Leu Phe Val Gin He Pro Ala Asp Asp Thr Leu Gly Glu Pro Tyr Asn 

595 600 605 

Thr Asn Thr Thr Gin Leu Val He Lys Leu Arg Asp He Asn Asn Thr 

610 615 620 

Pro Pro Thr Leu Arg Leu Pro Arg Ala Thr Pro Ser Val Glu Glu Asn 
625 630 635 640 

Val Pro Asp Gly Phe Val He Pro Thr Gin Leu His Ala Thr Asp Pro 

64 5 650 655 

Asp Thr Thr Ala Glu Leu Arg Phe Glu He Asp Trp Gin Asn Ser Tyr 

660 665 ' 670 

Ala Thr Lys Gin Gly Arg Asn Thr Asp Ser Lys Glu Tyr He Gly Cys 

675 680 685 

He Glu He Glu Thr He Tyr Pro Asn He Asn Gin Arg Gly Asn Ala 

690 695 700 

He Gly Arg Val Val Val Arg Glu He Arg Asp Gly Val Thr He Asp 
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705 710 715 720 

Tyr Glu Met Phe Glu Val Leu Tyr Leu Thr Val He Val Arg Asp Leu . 

725 730 735 

Asn Thr Val He Gly Glu Asp His Asp He Ser Thr Phe Thr He Thr 

740 745 750 

He He Asp Met Asn Asp Asn Pro Pro Leu Trp Val Glu Gly Thr Leu 

755 760 765 

Thr Gin Glu Phe Arg Val Arg Glu Val Ala Ala Ser Gly Val Val He 

770 775 780 

Gly Ser Val Leu Ala Thr Asp He Asp Gly Pro Leu Tyr Asn Gin Val 
785 790 795 800 

Arg Tyr Thr He Thr Pro Arg Leu Asp Thr Pro Glu Asp Leu Val Asp 

805 810 815 

He Asp Phe Asn Thr Gly Gin He Ser Val Lys Leu His Gin Ala He 

820 825 830 

Asp Ala Asp Glu Pro Pro Arg Gin Asn Leu Tyr Tyr Thr Val He Ala 

835 840 845 

Ser Asp Lys Cys Asp Leu Leu Thr Val Thr Glu Cys Pro Pro Asp Pro 

850 855 860 

Thr Tyr Phe Glu Thr Pro Gly Glu He Thr He His He Thr Asp Thr 
865 870 875 880 

Asn Asn Lys Val Pro Gin Val Glu Asp Asp Lys Phe Glu Ala.Thr Val 

885 890 895 

Tyr He Tyr Glu Gly Ala Asp Asp Gly Gin His Val Val Gin He Tyr 

900 905 910 

Ala Ser Asp Leu Asp Arg Asp Glu He Tyr His Lys Val Ser Tyr Gin 

915 920 ~ 925 

He Asn Tyr Ala He Asn Ser Arg Leu Arg Asp Phe Phe Glu Met Asp 

930 935 940 

Leu Glu Ser Gly Leu Val Tyr Val Asn Asn Thr Ala Gly Glu Leu Leu 
945 950 955 960 

Asp Arg Asp Gly Asp Glu Pro Thr His Arg He Phe Phe Asn Val He 

965 970 975 

Asp Asn Phe Tyr Gly Glu Gly Asp Gly Asn Arg Asn Gin Asn Glu Thr 

980 985 990 

Gin Val Leu Val Val Leu Leu Asp He Asn Asp Asn Tyr Pro Glu Leu 

995 1000 - 1005 

Pro Glu Thr He Pro Trp Ala He Ser Glu Ser Leu Glu Leu Gly Glu 

1010 1015 1020 

Arg Val Gin Pro Glu He Phe Ala Arg Asp Arg Asp Glu Pro Gly Thr 
1025 1030 1035 1040 

Asp Asn Ser Arg Val Ala Tyr Ala He Thr Gly Leu Ala Ser Thr Asp 

1045 1050 1055 

Arg Asp He Gin Val Pro Asn Leu Phe Asn Met He Thr lie Glu Arg 

1060 1065 1070 

Asp Arg Gly He Asp Gin Thr Gly He Leu Glu Ala Ala Met Asp Leu 

1075 1080 1085 

Arg Gly Tyr Trp Gly Thr Tyr Gin He Asp He Gin Ala Tyr Asp His 

1090 1095 1100 

Gly He Pro Gin Arg He Ser Asn Gin Lys Tyr Pro Leu Val He Arg 
1105 1110 1H5 1120 

Pro Tyr Asn Phe His Asp Pro Val Phe Val Phe Pro Gin Pro Gly Ser 

1125 1130 H35 

Thr He Arg Leu Ala Lys Glu Arg Ala Val Val Asn Gly He Leu Ala 

1140 1145 1150 

Thr Val Asp Gly Glu Phe Leu Asp Arg He Val Ala Thr Asp Glu Asp 

1155 1160 ~ H65 

Gly Leu Glu Ala Gly Leu Val Thr Phe Ser He Ala Gly Asp Asp Glu 

1170 1175 1180 

Asp Ala Gin Phe Phe Asp Val Leu Asn Asp Gly Val Asn Ser Gly Ala 
1185 1190 1195 1200 

Leu Thr Leu Thr Arg Leu Phe Pro Glu Glu Phe Arg Glu Phe Gin Val 
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1205 1210 1215 

Thr lie Arg Ala Thr Asp Gly Gly Thr Glu Pro Gly Pro Arg Ser Thr - 

1220 1225 1230 

Asp Cys Leu Val Thr Val Val Phe Val Pro Thr Gin Gly Glu Pro Val 

1235 1240 1245 

Phe Glu Asp Arg Thr Tyr Thr Val Ala Phe Val Glu Lys Asp Glu Gly 

1250 1255 1260 

Met Leu Glu Glu Ala Glu Leu Pro Arg Ala Ser Asp Pro Arg Asn lie 
1265 1270 1275 1280 

Met Cys Glu Asp Asp Cys His Asp Thr Tyr Tyr Ser He Val Gly Gly 

1285 1290 1295 

Asn Ser Gly Glu His Phe Thr Val Asp Pro Arg Thr Asn Val Leu Ser 

1300 1305 1310 

Leu Val Lys Pro Leu Asp Arg Ser Glu Gin Glu Thr His Thr Leu lie 

1315 1320 1325 

He Gly Ala Ser Asp Thr Pro Asn Pro Ala Ala Val Leu Gin Ala Ser 

1330 1335 1340 

Thr Leu Thr Val Thr Val Asn Val Arg Glu Ala Asn Pro Arg Pro Val 
1345 1350 1355 1360 

Phe Gin Arg Ala Leu Tyr Thr Ala Gly He Ser Ala Gly Asp Phe He 

1365 1370 1375 

Glu Arg Asn Leu Leu Thr Leu Val Ala Thr His Ser Glu ASp-Leu Pro 

1380 1385 1390 

He Thr Tyr Thr Leu He Gin Glu Ser Met Glu Ala Asp Pro Thr Leu 

1395 1400 1405 

Glu Ala Val Gin Glu Ser Ala Phe He Leu Asn Pro Glu Thr Gly Val 

1410 1415 1420 

Leu Ser Leu Asn Phe Gin Pro Thr Ala Ser Met His Gly Met Phe Glu 
1425 1430 1435 1440 

Phe Glu Val Lys Ala Thr Asp Ser Arg Thr Glu Thr Ala Arg Thr Glu 

1445 1450 1455 

Val Lys. Val Tyr Leu He Ser Asp Arg Asn Arg Val Phe Phe Thr Phe 

1460 1465 1470 

Asn Asn Pro Leu Pro Glu Val Thr Pro Gin Glu Asp Phe He Ala Glu 

1475 1480 1485 

Thr Phe Thr Ala Phe Phe Gly Met Thr Cys Asn He Asp Gin Ser Trp 

1490 1495 1500 

Trp Ala Ser Asp Pro Val Thr Gly Ala Thr Lys Asp Asp Gin Thr Glu 
1505 1510 1515 1520 

Val Arg Ala His Phe He Arg Asp Asp. Leu Pro Val Pro Ala Glu Glu 

1525 1530 1535 

He Glu Gin Leu Arg Gly Asn Pro Thr Leu Val Asn Ser He Gin Arg 

1540 1545 1550 

Ala Leu Glu Glu Gin Asn Leu Gin Leu Ala Asp Leu Phe Thr Gly Glu 

1555 1560 1565 

Thr Pro He Leu Gly Gly Asp Ala Gin Ala Arg Ala Leu Tyr Ala Leu 

1570 1575 1580 

Ala Ala Val Ala Ala Ala Leu Ala Leu He Val Val Val Leu Leu He 
1585 1590 1595 1600 

Val Phe Phe Val Arg Thr Arg Thr Leu Asn Arg Arg Leu Gin Ala Leu 

1605 1610 1615 

Ser Met Thr Lys Tyr Ser Ser Gin Asp Ser Gly Leu Asn Arg Val Gly 

1620 1625 1630 

Leu Ala Ala Pro Gly Thr Ash Lys His Ala Val Glu Gly Ser Ash Pro 

1635 1640 1645 

He Trp Asn Glu Thr Leu Lys Ala Pro Asp Phe Asp Ala Leu Ser Glu 

1650 1655 ~ 1660 

Gin Ser Tyr Asp Ser Asp Leu He Gly He Glu Asp Leu Pro Gin Phe 
1665 1670 1675 1680 

Arg Asn Asp Tyr Phe Pro Pro Glu Glu Gly Ser Ser Met Arg Gly Val 

1685 1690 1695 

Val Asn Glu His Val Pro Glu Ser He Ala Ash His Asn Asn Asn Phe 
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1700 1705 1710 

Gly Phe Asn Ser Thr Pro Phe Ser Pro Glu Phe Ala Asn Thr Gin Phe - - ~~ 
1715 1720 1725 

Arg Arg 
1730 

<210> 5 
<211> 5592 
<212> DNA 

<213> Spodoptera frugiperda 

<220> 
<221> CDS 

<222> (162) ... (5363) 
<400> 5 

gacattctgt ggtgaaaaca ttttttattt atttttttct agtggtttgt gggtacagtg 60 
taaacatttt ggaatattgt taaagatttc ggaatattgt taaagtattg acagataaag ~ 120 
ctgtaacatc actagagaag tgagaactgc aagatcatga g atg gcg gtc gat gtg 176 

Met Ala Val Asp Val 
1 5 

cga ata ctg aca gca aca ttg ctg gta etc acc act get aca gca cag 224 
Arg He Leu Thr Ala Thr Leu Leu Val Leu Thr Thr Ala Thr Ala Gin 
10 15 20 

cga gat cga tgt ggc tac atg gta gaa ata ccc aga cca gac agg cct 272 
Arg Asp Arg Cys Gly Tyr Met Val Glu He Pro Arg Pro Asp Arg Pro 
25 30 35 

gac ttc cca cct caa aat ttt gac ggt tta aca tgg get cag cag cca 320 
Asp Phe Pro Pro Gin Asn Phe Asp Gly Leu Thr Trp Ala Gin Gin Pro 
40 45 50 

eta tta cca get gag gat cga gaa gag gtc tgc etc aat gac tat gaa 368 
Leu Leu Pro Ala Glu Asp Arg Glu Glu Val Cys Leu Asn Asp Tyr Glu 
55 60 65 

cct gat ccc tgg age aac aac cat ggt gac cag aga att tac atg gag 416 
Pro Asp Pro Trp Ser Asn Asn His Gly. Asp Gin Arg He Tyr Met Glu 
70 75 80 85 

gag gag ate gaa ggt ccc gta gtc att geg aaa att aac tac caa gga 4 64 

Glu Glu lie Glu Gly Pro Val Val lie Ala Lys He Asn Tyr Gin Gly 
90 95 100 

aac acc cct cct caa ata aga tta cct ttt cgt gtt ggt gca gee cac 512 
Asn Thr Pro Pro Gin He Arg Leu Pro Phe Arg Val Gly Ala Ala His 
105 110 115 

atg ctt giga gca gaa att cgt gaa tat cct gac gca act gga gac tgg 560 
Met Leu Gly Ala Glu He Arg Glu Tyr Pro Asp Ala Thr Gly Asp Trp 
120 125 130 

tat ctt gta att act caa agg cag gac tat gaa act' Lcct ^gat.^at^ cag 608 
Tyr Leu Val He Thr Gin Arg Gin Asp Tyr Glu Thr Pro Asp Met Gin*' 
135 140 145 

aga tac acg ttc gat gtg agt gtg gaa ggc cag teg ctg gtt gta acg 656 
Arg Tyr Thr Phe Asp Val Ser Val Glu Gly Gin Ser Leu Val Val Thr 
150 »; . 155 160 . 165 
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gtg agg ctg gat att gtg aac ate gac gac aat gcg ccc ate att gag 704 
Val Arg Leu Asp lie Val Asn lie Asp Asp Asn Ala Pro He He Glu - - 
170 175 180 

atg tta gag cct tgc aac tta ccg gaa ctt gtt gaa ccc cat gtt aca 752 
Met Leu Glu Pro Cys Asn Leu Pro Glu Leu Val Glu Pro His Val Thr 
185 190 195 

gaa tgt aaa tat ate gtg tec gac gca gac ggt ctg ate agt aca agt 800 
Glu Cys Lys Tyr lie Val. Ser Asp Ala Asp Gly Leu He Ser Thr Ser 
200 205 210 

gtt atg agt tat cat ata gac age gag aga gga gac gaa aaa gta ttc 848 
Val Met Ser Tyr His He Asp Ser Glu Arg Gly Asp Glu Lys Val Phe 
215 220 225 

gaa ctg ate aga aaa gat tat ccg ggc gat tgg acg aag gtg tat atg 896 
Glu Leu He Arg Lys Asp Tyr Pro Gly Asp Trp Thr Lys Val Tyr Met 
230 235 240 245 

gtt ctt gaa ttg aaa aaa tct ctt gat tac gaa gag aat cct eta cac 944 
Val Leu Glu Leu Lys Lys Ser Leu Asp Tyr Glu Glu Asn Pro^Leu His 
250 255 260 

ata ttc aga gtc acg get tct gat tec tta cca aac aat agg acc gtg 992 
He Phe Arg Val Thr Ala Ser Asp Ser Leu Pro Asn Asn Arg Thr Val 
265 270 275 

gtc atg atg gtt gaa gta gag aac gtg gaa cat aga aat cct egg tgg 1040 
Val Met Met Val Glu Val Glu Asn Val Glu His Arg Asn Pro Arg Trp 
280 285 290 

atg gag ate ttt get gtg caa cag ttt gat gaa aaa cag gcg aaa teg 1088 
Met Glu He Phe Ala Val Gin Gin Phe Asp Glu Lys Gin Ala Lys Ser 
295 300 305 

ttc aca gtg cga get att gat ggc gac acg gga ate aat aaa cct ata 1136 
Phe Thr Val Arg Ala He Asp Gly Asp Thr Gly He Asn Lys Pro He 
310 . 315 320 325 

ttc tat cgt ata gaa act gaa gat gaa gac aaa gag ttc ttc age att 1184 
Phe Tyr Arg He Glu Thr Glu Asp Glu Asp Lys Glu Phe Phe Ser He 
330 335 340 

gag aac ata ggg gaa ggc aga gac ggt gec aga ttc cac gtg get cct 1232 
Glu Asn He Gly Glu Gly Arg Asp Gly Ala Arg Phe His Val Ala Pro 
345 350 355 

ata gac aga gac tac ctg aaa agg gat atg ttt cat ata aga ata att 1280 
He Asp Arg Asp Tyr Leu Lys Arg Asp Met Phe His He Arg He lie 
360 365 370 

gca tat aaa caa ggt gat aat gac aaa gaa ggt gaa tea teg ttc gag 1328 
Ala Tyr Lys Gin Gly Asp Asn Asp Lys Glu Gly Glu Ser Ser Phe Glu 
375 380 385 

acc tea gca aat gtg acg att ata att aac gat ata aat gat cag agg 137 6 

Thr Ser Ala Asn Val Thr He lie He Asn Asp He Asn Asp Gin Arg 
390 395 400 405 

cca gaa ccc ttc cat aaa gaa tac acg ate tec ata atg gaa gaa act 1424 
Pro Glu Pro Phe His Lys Glu Tyr Thr He Ser lie Met Glu Glu Thr 
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410 415 420 

gcg atg acc tta gat ttg caa gag ttt ggt ttc cat gac cgt gac att 14 72 

Ala Met Thr Leu Asp Leu Gin Glu Phe Gly Phe His Asp Arg Asp lie 
425 430 435 

ggt ccc cac get cag tac gac gtt cac tta gag agt ata cag cca gag 1520 
Gly Pro His Ala Gin Tyr Asp Val His Leu Glu Ser He Gin Pro Glu 
440 445 450 

ggg gec cat acc get ttc tac ate gee cct gaa gaa ggt tac cag gec 1568 
Gly Ala His Thr Ala Phe Tyr He Ala Pro Glu Glu Gly Tyr Gin Ala 
455 460 465 

cag tct ttc acc ata ggt act aga ate cat aac atg ttg gat tat gaa 1616 
Gin Ser Phe Thr lie Gly Thr Arg lie His Asn Met Leu Asp Tyr Glu 
470 475 480 485 

gat gac gac tac aga cca gga ata aag eta aag gca gta gca att gac 1664 
Asp Asp Asp Tyr Arg Pro Gly He Lys Leu Lys Ala Val Ala He Asp 
4 90 4 95 500 

aga cac gat aac aat cac att ggg gaa gca att att aac att aac ctt 1712 
Arg His Asp Asn Asn His He Gly Glu Ala He He Asn He Asn Leu 
505 510 515 

ate aat tgg aat gat gag eta cct ata ttc gac gag gac gee tac aac 1760 
He Asn Trp Asn Asp Glu Leu Pro He Phe Asp Glu Asp Ala Tyr Asn 
520 525 530 

gtg aca ttt gag gag acg gtc ggt gat ggc ttc cac att ggt aaa tac 1808 
Val Thr Phe Glu Glu Thr Val Gly Asp Gly Phe His He Gly Lys Tyr 
535 540 545 

egg get aaa gac aga gac ate ggt gac ata gtc gag cac teg ata ttg 1856 
Arg Ala Lys Asp Arg Asp He Gly Asp He Val Glu His Ser lie Leu 
550 555 560 565 

ggc aac get gca aac ttc ctg aga att gac ata gat act gga gat gtg 1904 
Gly Asn Ala Ala Asn Phe Leu Arg He Asp He Asp Thr Gly Asp Val 
570 575 580 

tac gtg tea egg gac gat tac ttt gat tat caa aga cag aac gaa ate 1952 
Tyr Val Ser Arg Asp Asp Tyr Phe Asp Tyr Gin Arg Gin Asn Glu He 
585 590 595 

ata gtt cag att ctg get gtt gat aca eta ggt tta cct cag aac agg 2000 
He Val Gin lie Leu Ala Val Asp Thr Leu Gly Leu Pro Gin Asn Arg 
600 1 605 610 

get acc aca cag etc acg ata ttt ttg gaa gac ate aac aac acg cca 2048 
Ala Thr Thr Gin Leu Thr He Phe Leu Glu Asp He Asn Asn Thr Pro 
615 .620 625 

cct ata ctg cga ctg cca cgt tec agt cca agt gta gaa gag aac gtt 2096 
Pro He Leu Arg Leu Pro Arg Ser Ser Pro Ser Val Glu Glu Asn Val 
630 635 640 645 

gaa gtc ggg cac ccg att acc gag ggg eta acg gcg aca gac cca gac 2144 
Glu Val Gly His Pro He Thr Glu Gly Leu Thr Ala Thr Asp Pro Asp 
650 655 660 
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acc aca gcc gat tta cac ttc gag ate gat tgg gac aat tct tac get 2192 
Thr Thr Ala Asp Leu His Phe Glu lie Asp Trp Asp Asn Ser Tyr Ala- - 
665 -670 675 

acg aag cag ggc acc aat gga ccc aac act gca gac tac cac gga tgc 224 0 

Thr Lys Gin Gly Thr Asn Gly Pro Asn Thr Ala Asp Tyr His Gly Cys 
680 685 690 

gta gaa ate ctg acg gta tac cca gat cct gac aat cac ggg aga get 2288 
Val Glu He Leu Thr Val Tyr Pro Asp Pro Asp Asn His Gly Arg Ala 
695 700 705 

gag ggt cac ttg gtg gca cgt gag gtc agt gat ggc gtg acc ate gat 2336 
Glu Gly His Leu Val Ala Arg Glu Val Ser Asp Gly Val Thr lie Asp 
710 715 720 725 

tac gag aag ttt gag gtg ctg tac etc gtc gtc agg gtg ata gat cgc 2384 
Tyr Glu Lys Phe Glu Val Leu Tyr Leu Val Val Arg Val He Asp Arg 
730 735 740 

aac act gtc att ggc cct gat tat gac gaa gca atg ctg acg gtg acg 24 32 

Asn Thr Val He Gly Pro Asp Tyr Asp Glu Ala Met Leu Thjr-Val Thr 
745 750 755 

ata ate gat atg aac gac aac tgg ccg ata tgg gcc gac aac acg ctg 2480 
He He Asp Met Asn Asp Asn Trp Pro He Trp Ala Asp Asn Thr Leu 
760 765 770 

cag cag aca ctg cgc gtg cgc gag atg gcc gac gaa gga gtc ate gtc 2528 
Gin Gin Thr Leu Arg Val Arg Glu Met Ala Asp Glu Gly Val He Val 
775 780 785 

ggt aca ctg etc gcc acc gac ttg gat ggc cct etc tac aac cga gtc 2576 
Gly Thr Leu Leu Ala Thr Asp Leu Asp Gly Pro Leu Tyr Asn Arg Val 
790 795 800 805 

cgc tac acc atg gtc ccc ate aag gac act cct gat gac eta ata gcg 2624 
Arg Tyr Thr Met Val Profile Lys Asp Thr Pro Asp Asp Leu He Ala 
810 . 815 820 

ate aac; tap gtc acc ggt cag ctg act gtg aac aag ggg caa gca att 2672 
He Asn Tyr Val Thr Gly Gin Leu Thr Val Asn Lys Gly Gin Ala He 
825 830 835 

gac gca gat gat cca cct cgc ttc tac ctg tat tac aag gtc act gcc 2720 
Asp Ala Asp Asp Pro Pro Arg Phe Tyr Leu Tyr Tyr Lys Val Thr Ala 
840 845 850 

age gat aag tgc tct ctt gac gag ttc ttc cct gtg tgc cca cct gac 2768 
Ser Asp Lys Cys Ser Leu Asp Glu Phe Phe Pro Val Cys Pro Pro Asp 
855 860 865 

ccc act tac tgg aat acc gag gga gag ata gcg ate gcg ata acc gat 2816 
Pro Thr Tyr Trp Asn Thr Glu Gly Glu He Ala He Ala He Thr Asp 
870 875 880 885 

acg aac aac aaa att cca cgc gcg gaa aca gat atg ttc cct agt gaa 2864 
Thr Asn Asn Lys He Pro Arg Ala Glu Thr Asp Met Phe Pro Ser Glu 
890 895 900 

aag cgc ate tat gag aac aca ccc aat ggt acc aag ate acg acg ate 2912 
Lys Arg He Tyr Glu Asn Thr Pro Asn Gly Thr Lys He Thr Thr lie 
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905 910 915 

ate get agt gac cag gac aga gat cga cca aat aac gcg ctg acg tac 2960 
He Ala Ser Asp Gin Asp Arg Asp Arg Pro Asn Asn Ala Leu Thr Tyr 
920 925 930 

aga ate aac tac gca ttc aac cac agg ctg gag aac ttc ttc gca gtg 3008 
Arg He Asn Tyr Ala Phe Asn His Arg Leu Glu Asn Phe Phe Ala Val 
935 940 945 

gac cct gat act ggt gaa ctg ttt gtc cac ttc acc act age gaa gtg 3056 
Asp Pro Asp Thr Gly Glu Leu Phe Val His Phe Thr Thr Ser Glu Val 
950 955 960 965 

ttg gac aga gac gga gag gaa ccg gag cat agg ate ate ttc acc ate 3104 
Leu Asp Arg Asp Gly Glu Glu Pro Glu His Arg He lie Phe Thr He 
970 975 980 

gtc gat aac ttg gaa ggc get gga gat ggc aat cag aac aca ate tec 3152 
Val Asp Asn Leu Glu Gly Ala Gly Asp Gly Asn Gin Asn Thr He Ser 
985 990 995 

acg gag gtg cgt gtt ata ctg ctt gat ata aac gac aat aag ccg gaa 3200 
Thr Glu Val Arg Val He Leu Leu Asp He Asn Asp Asn Lys Pro Glu 
1000 1005 1010 

eta cca att cct gat ggc gaa ttt tgg acc gtt tec gaa ggt gaa gtg 324 8 

Leu Pro He Pro Asp Gly Glu Phe Trp Thr Val Ser Glu Gly Glu Val 
1015 1020 1025 

gag gga aaa cgc att cca cca gag att cac gca cac gac aga gat gaa 3296 
Glu Gly. Lys Arg He Pro Pro Glu He His Ala His Asp Arg Asp Glu 
1030 1035 1040 1045 

cca ttc aac gac aac tct cgc gtg gga tat gaa att cga teg ate aaa 3344 
Pro Phe Asn Asp Asn Ser Arg Val Gly Tyr Glu He Arg Ser lie Lys 
1050 1055 1060 

ttg ate aat aga gac ate gag ctt cct caa gat cca ttc aaa ata ata 3392 
Leu He Asn Arg Asp He Glu Leu Pro . Gin Asp Pro Phe Lys He He 
1065 1070 1075 

acg att gat gat etc gat acc tgg aaa ttc gtt gga gag ttg gag act 3440 
Thr He Asp Asp Leu Asp Thr Trp Lys Phe Val Gly Glu Leu Glu Thr 
1080 1085 1090 

acc atg gac ctt aga gga tac tgg gga acc tat gat gtc gag ata cgt 3488 
Thr Met Asp Leu Arg Gly Tyr Trp Gly Thr Tyr Asp Val Glu He Arg 
1095 1100 1105 

gcg ttt gac cac ggt ttc ccg atg ctg gat tea ttc gag acc tac caa 3536 
Ala Phe Asp His Gly Phe Pro Met Leu Asp Ser Phe Glu Thr Tyr Gin 
1110 1115 1120 1125 

eta acc gtc agg cca tac aac ttc cat tea ccg gtg ttt gtg ttc cca 3584 
Leu Thr Val Arg Pro Tyr Asn Phe His Ser Pro Val Phe Val Phe Pro 
1130 1135 1140 

act cct ggc tea acc ate agg ctt tct agg gag cgt get ata gtc aat 3632 
Thr Pro Gly Ser Thr He Arg Leu Ser Arg Glu Arg Ala He Val Asn 
1145 1150 1155 
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ggt atg ctg get ctg get aat ate gcg age gga gag ttc etc gac aga 
Gly Met Leu Ala Leu Ala Asn lie Ala Ser Gly Glu Phe Leu Asp Arg 
1160 1165 1170 



3680 



etc tct gec act gat gaa gat ggg eta cac gca ggc aga gta act ttc 
Leu Ser Ala Thr Asp Glu Asp Gly Leu His Ala Gly Arg Val Thr Phe 
1175 1180 1185 



3728 



tec ata get gga aac gat gaa get gcg gaa tat ttc aat gtg ttg aac 
Ser lie Ala Gly Asn Asp Glu Ala Ala Glu Tyr Phe Asn Val Leu Asn 
1190 1195 1200 1205 



3776 



gac ggt gac aac tea gca atg etc acg ctg aag caa gca ttg ccc get 
Asp Gly Asp Asn Ser Ala Met Leu Thr Leu Lys Gin Ala Leu Pro Ala 
1210 1215 1220 



3824 



ggc gtc cag cag ttt gag ttg gtt att egg gee acg gac ggc ggg acg 
Gly Val Gin Gin Phe Glu Leu Val lie Arg Ala Thr Asp Gly Gly Thr 
1225 1230 1235 



3872 



gag ccg gga cct agg agt acc gac tgc tec gtc act gtg gtg ttt gtg 
Glu Pro Gly Pro Arg Ser Thr Asp Cys Ser Val Thr Val Val ^>he Val 
1240 1245 1250 



3920 



atg acg cag gga gac ccc gtg ttc gac gac aac gca get tct gtc cgc 
Met Thr Gin Gly Asp Pro Val Phe Asp Asp Asn Ala Ala Ser Val Arg 
1255 1260 1265 



3968 



ttc gtt gaa aag gaa get ggt atg teg gaa aag ttt cag ctg cct cag 
Phe Val Glu Lys Glu Ala Gly Met Ser Glu Lys Phe Gin Leu Pro Gin 
1270 1275 1280 1285 



4016 



gec gat gac ccc aaa aac tac agg tgt atg gac gac tgc cat acc ate 
Ala Asp Asp Pro Lys Asn Tyr Arg Cys Met Asp Asp Cys His Thr lie 
1290 1295 " 1300 



4064 



tac tac tct ate gtt gat ggc aac gat ggt gac cac ttc gee gtg gag 
Tyr Tyr Ser lie Val Asp Gly Asn Asp Gly Asp His Phe Ala Val Glu 
1305 1310 1315 



4112 



ccg gag act aac gtg ate tat ttg ctg aag ccg ctg gac cgc age caa 
Pro Glu Thr Asn Val lie Tyr Leu Leu Lys Pro Leu Asp Arg Ser Gin 
1320 1325 >- 1330 



4160 



cag gag cag tac agg gtc gtg gtg gcg get tec aac acg cct ggc ggc 
Gin Glu Gin Tyr Arg Val Val Val Ala Ala Ser Asn Thr Pro Gly Gly 
1335 i 1340 1345 



4208 



acc tec acc ttg tec tec tea etc etc acc gtc acc ate ggc gtt cga 
Thr Ser Thr Leu Ser Ser Ser Leu Leu Thr Val Thr lie Gly Val Arg 
1350 1355 1360 1365 



4256 



gaa gca aac cct aga ccg ate ttc gaa agt gaa ttt tac aca get ggc 
Glu Ala Asn Pro Arg Pro lie Phe Glu Ser Glu Phe Tyr Thr Ala Gly 
1370 1375 1380 



4304 



gtc tta cac acc gat age ata cac aag gag etc gtt tac ctg gcg gca 
Val Leu His Thr Asp Ser lie His Lys Glu Leu Val Tyr Leu Ala Ala 
1385 1390 1395 



4352 



aaa cat tea gaa ggg ctt cct ate gtc tac teg ata gat caa gaa acc 
Lys His Ser Glu Gly Leu Pro lie Val Tyr Ser lie Asp Gin Glu Thr 
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1400 1405 1410 

atg aaa ata gac gag teg ttg caa aca gtt gtg gag gac gec ttc gac 
Met Lys lie Asp Glu Ser Leu Gin Thr Val Val Glu Asp Ala Phe Asp 
1415 1420 1425 

att aac tct gca acc gga gtc ata teg ctg aac ttc cag cca aca tct 
lie Asn Ser Ala Thr Gly Val lie Ser Leu Asn Phe Gin Pro Thr Ser 
1430 1435 1440 1445 

gtc atg cac ggc agt ttc gac ttc gag gtg gtg get agt gac acg cgt 
Val Met His Gly Ser Phe Asp Phe Glu Val Val Ala Ser Asp Thr Arg 
1450 1455 1460 

gga gcg agt gat cga gca aaa gtg tea att tac atg ata teg act cgc 
Gly Ala Ser Asp Arg Ala Lys Val Ser lie Tyr Met lie Ser Thr Arg 
1465 1470 1475 

gtt aga gta gee ttc ctg ttc tac aac acg gaa get gaa gtt aac gag 
Val Arg Val Ala Phe Leu Phe Tyr Asn Thr Glu Ala Glu Val Asn Glu 
1480 1485 1490 



aga aga aat ttc att gca caa acg ttc gee aac gcg ttt ggt atg aca 
Arg Arg Asn Phe lie Ala Gin Thr Phe Ala Asn Ala Phe Gly Met Thr 
1495 1500 1505 

tgt aac ata gac age gtg ctg ccg get acc gac gee aac ggc gtg att 
Cys Asn lie Asp Ser Val Leu Pro Ala Thr Asp Ala Asn Gly Val He 
1510 1515 1520 1525 

cgc gag ggg tac aca gaa etc cag get cac ttc ata cga gac gac cag 
Arg Glu Gly Tyr Thr Glu Leu Gin Ala His Phe He Arg Asp Asp Glh 
1530 1535 1540 

ccg gtg cca gee gac tat att gag gga tta ttt acg gaa etc aat aca 
Pro Val Pro Ala Asp Tyr lie Glu Gly Leu Phe Thr Glu Leu Asn Thr 
1545 1550 1555 

ttg cgt gac ate aga gag gta ctg agt act cag caa ttg acg eta ctg 
Leu Arg Asp He Arg Glu Val Leu Ser Thr Gin Gin Leu Thr Leu Leu 
1560 1565 1570 

gac ttt gcg gcg gga ggg teg gca gtg ctg ccc ggc gga gag tac gcg 
Asp Phe Ala Ala Gly Gly Ser Ala Val Leu Pro Gly Gly Glu Tyr Ala 
1575 1580 1585 

eta gcg gtg tac ate etc gee ggc ate gca gcg tta etc gec gtc ate 
Leu Ala Val Tyr He Leu Ala Gly lie Ala Ala Leu Leu Ala Val He 
1590 1595 1600 1605 

tgt etc get etc etc ate get ttc ttc att agg aac cga aca ctg aac 
Cys Leu Ala Leu Leu He Ala Phe Phe He Arg Asn Arg Thr Leu Asn 
1610 1615 1620 

egg cgc ate gaa gee etc aca ate aaa gat gtt cct acg gac ate gag 
Arg Arg He Glu Ala Leu Thr He Lys Asp Val Pro Thr Asp He Glu 
1625 1630 1635 

cca aac cac gcg tea gta gca gtg eta aac att aac aag cac aca gaa 
Pro Asn His Ala Ser Val Ala Val Leu Asn He Asn Lys His Thr Glu 
1640 1645 1650 



4448 



4496 



4544 



4592 



4640 



4 688 



4736 



4784 



4832 



4880 



4928 



4976 



5024 



5072 



5120 
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cct ggt tec aat ccc ttc tat aac ccg gat gtt aag aca cct aac ttc 5168 
Pro Gly Ser Asn Pro Phe Tyr Asn Pro Asp Val Lys Thr Pro Asn Phe- - — ~- 
1655 1660 1665 

gac act ata age gaa gta tec gat gac ctg ctt gat gtc gaa gac ttg 5216 
Asp Thr lie Ser Glu Val Ser Asp Asp Leu Leu Asp Val Glu Asp Leu 
1670 1675 1680 1685 

gaa cag ttt gga aag gat tac ttc cca ccc gaa aac gaa att gag age 5264 
Glu Gin Phe Gly Lys Asp Tyr Phe Pro Pro Glu Asn Glu lie Glu Ser 
1690 1695 1700 

ctg aat ttt gca cgt aac ccc ata gcg aca cac ggg aac aac ttt ggc 5312 
Leu Asn Phe Ala Arg Asn Pro lie Ala Thr His Gly Asn Asn Phe Gly 
1705 1710 1715 

gta aac tea age ccc tec aac cca gag ttc tec aac tec cag ttt aga 5360 
Val Asn Ser Ser Pro Ser Asn Pro Glu Phe Ser Asn Ser Gin Phe Arg 
1720 1725 1730 

agt taaactaaat acacttttat cacttgeata gacttatgta tttaataatt 5413 

Ser --. * 



ttacattttt tacattaaat ataaatgttt tatatgtaat aatagtgtga taaaatgtac 5473 
gtaacaatca acatagctgt tgtaggttcg taaataacat actegtaatg tataagtgtt 5533 
atgtttatat atagaaataa aaatattaaa tattaaaaaa aaaaaaaaaa aaaaaaaaa 5592 

<210> 6 
<211> 1734 
<212> PRT 

<213> Spodoptera frugiperda 
<400> 6 

Met Ala Val Asp Val Arg He Leu Thr Ala Thr Leu Leu Val Leu Thr 

1 5 10 15 

Thr Ala Thr Ala Gin Arg Asp Arg Cys Gly Tyr Met Val Glu He Pro 

20 25 30 

Arg Pro Asp Arg Pro Asp Phe Pro Pro Gin Asn Phe Asp Gly Leu Thr 

35 40 45 

Trp Ala Gin Gin Pro Leu Leu Pro Ala Glu Asp Arg Glu Glu Val Cys 

50 55 60 

Leu Asn Asp Tyr Glu Pro Asp Pro Trp Ser Asn Asn His Gly Asp Gin 
65 70 75 80 

Arg He Tyr Met Glu Glu Glu He Glu Gly Pro Val Val He Ala Lys 

85 90 95 

He Asn Tyr Gin Gly Asn Thr Pro Pro Gin He Arg Leu Pro Phe Arg 

100 105 110 

Val Gly Ala Ala His Met Leu Gly Ala Glu He Arg Glu Tyr Pro Asp 

115 120 125 

Ala Thr Gly Asp Trp Tyr Leu Val He Thr Gin Arg Gin Asp Tyr Glu 

130 135 140 

Thr Pro Asp Met Gin Arg Tyr Thr Phe Asp Val Ser Val Glu Gly Gin 
145 150 155 160 

Ser Leu Val Val Thr Val Arg Leu Asp He Val Asn lie Asp Asp Asn 

165 170 175 

Ala Pro He He Glu Met Leu Glu Pro Cys Asn Leu Pro Glu Leu Val 

180 185 190 

Glu Pro His Val Thr Glu Cys Lys Tyr lie Val Ser Asp Ala Asp Gly 

195 200 205 

Leu He Ser Thr Ser Val Met Ser Tyr His He Asp Ser Glu Arg Gly 
210 215 220 
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Asp Glu Lys Val Phe Glu Leu lie Arg Lys Asp Tyr Pro Gly Asp Trp 
225 230 235 240 

Thr Lys Val Tyr Met Val Leu Glu Leu Lys Lys Ser Leu Asp Tyr Glu 

245 250 255 

Glu Asn Pro Leu His lie Phe Arg Val Thr Ala Ser Asp Ser Leu Pro 

260 265 270 

Asn Asn Arg Thr Val Val Met Met Val Glu Val Glu Asn Val Glu His 

275 280 285 

Arg Asn Pro Arg Trp Met Glu lie Phe Ala Val Gin Gin Phe Asp Glu 

290 295 300 

Lys Gin Ala Lys Ser Phe Thr Val Arg Ala lie Asp Gly Asp Thr Gly 
305 310 315 320 

lie Asn Lys Pro lie Phe Tyr Arg lie Glu Thr Glu Asp Glu Asp Lys 

325 330 335 

Glu Phe Phe Ser lie Glu Asn lie Gly Glu Gly Arg Asp Gly Ala Arg 

340 345 350 

Phe His Val Ala Pro lie Asp Arg Asp Tyr Leu Lys Arg Asp Met Phe 

355 360 365 

His lie Arg lie lie Ala Tyr Lys Gin Gly Asp Asn Asp Lys Glu Gly 

370 375 380 

Glu Ser Ser Phe Glu Thr Ser Ala Asn Val Thr lie lie lie Asn Asp 
385 390 395 - 400 

lie Asn Asp Gin Arg Pro Glu Pro Phe His Lys Glu Tyr Thr lie Ser 

405 410 415 

lie Met Glu Glu Thr Ala Met Thr Leu Asp Leu Gin Glu Phe Gly Phe 

420 425 430 

His Asp Arg Asp lie Gly Pro His Ala Gin Tyr Asp Val His Leu Glu 

435 440 445 

Ser lie Gin Pro Glu Gly Ala His Thr Ala Phe Tyr lie Ala Pro Glu 

450 455 460 

Glu Gly Tyr Gin Ala Gin Ser Phe Thr lie Gly Thr Arg He His Asn 
465 470 475 480 

Met Leu Asp Tyr Glu Asp Asp Asp Tyr Arg Pro Gly lie Lys Leu Lys 

485 4 90 495 

Ala Val Ala He Asp Arg His Asp Asn Asn His He Gly Glu Ala He 

500 505 510 

He Asn lie Asn Leu He Asn Trp Asn Asp Glu Leu Pro He Phe Asp 

515 520 525 

Glu Asp Ala Tyr Asn Val Thr Phe Glu Glu Thr Val Gly Asp Gly Phe 

530 535 540 

His He Gly Lys Tyr Arg Ala Lys Asp Arg Asp He Gly Asp He Val 
545 550 555 560 

Glu His Ser He Leu Gly Asn Ala Ala Asn Phe Leu Arg He Asp He 

565 570 575 

Asp Thr Gly Asp Val Tyr Val Ser Arg Asp Asp Tyr Phe Asp Tyr Glh 

580 585 590 

Arg Gin Asn Glu He He Val Gin lie Leu Ala Val Asp Thr Leu Gly 

595 600 605 

Leu Pro Gin Asn Arg Ala Thr Thr Gin Leu Thr He Phe Leu Glu Asp 

610 615 620 

He Asn Asn Thr Pro Pro He Leu Arg Leu Pro Arg Ser Ser Pro Ser 
625 630 635 640 

Val Glu Glu Asn Val Glu Val Gly His Pro He Thr Glu Gly Leu Thr 

645 650 655 

Ala Thr Asp Pro Asp Thr Thr Ala Asp Leu His Phe Glu He Asp Trp 

660 665 670 

Asp Asn Ser Tyr Ala Thr Lys Gin Gly Thr Asn Gly Pro Asn Thr Ala 

675 680 685 

Asp Tyr His Gly Cys Val Glu He Leu Thr Val Tyr Pro Asp Pro Asp 

690 695 700 

Asn His Gly Arg Ala Glu Gly His Leu Val Ala Arg Glu Val Ser Asp 
705 710 715 720 
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Gly val 


Thr 


He 


Asp 


Tyr 


Glu 


Lys 


Phe Glu 


Val Leu 


Tyr 


Leu 


Val 


Val 






725 








730 








735 


- 


Arg Val 


He 


Asp 


Arg 


Asn 


Thr 


Val 


He Gly 


Pro Asp 


Tyr 


Asp 


Glu 


Ala 




740 










745 






750 






Met Leu 


Thr 


Val 


Thr 


He 


He 


Asp 


Met Asn Asp Ash 


Trp 


Pro 


He 


Trp 




755 










760 






765 








Ala Asp 


Asn 


Thr 


Leu 


Gin 


Gin 


Thr 


Leu Arg 


Val Arg 


Glu 


Met 


Ala 


Asp 


770 










775 






780 










Glu Gly 


Val 


He 


Val 


Gly 


Thr 


Leu 


Leu Ala 


Thr Asp 


Leu 


Asp 


Gly 


Pro 


.785 








790 








795 








800 


Leu Tyr 


Asn 


Arg 


Val 


Arg 


Tyr 


Thr 


Met Val 


Pro He 


Lys 


Asp 


Thr 


Pro 






805 








810 








815 




Asp Asp 


Leu 


He 


Ala 


He 


Asn 


Tyr 


Val Thr 


Gly Gin 


Leu 


Thr 


Val 


Asn 




820 










825 






830 






Lys Gly 


Gin 


Ala 


He 


Asp 


Ala 


Asp 


Asp Pro 


Pro Arg 


Phe 


Tyr 


Leu 


Tyr 


835 










840 






845 








Tyr Lys 


Val 


Thr 


Ala 


Ser 


Asp 


Lys 


Cys Ser 


Leu Asp 


Glu 


Phe 


Phe 


Pro 


850 










855 






860 










Val Cys 


Pro 


Pro 


Asp 


Pro 


Thr 


Tyr 


Trp Asn 


Thr Glu 


Gly 


Glu 


He 


Ala 


865 








870 








875 








880 


lie Ala 


He 


Thr 


Asp 


Thr 


Asn 


Asn 


Lys He 


Pro Arg 


Ala 


Glu 


Thr 


Asp 








885 








890 






-895 




Met Phe 


Pro 


Ser 
900 


Glu 


Lys 


Arg 


He 


Tyr Glu 
905 


Asn Thr 


Pro 


Asn 
910 


Gly 


Thr 


Lys lie 


Thr 


Thr 


He 


He 


Ala 


Ser 


Asp Gin Asp Arg 


Asp 


Arg 


Pro 


Asn 


915 










920 






925 








Asn Ala 


Leu 


Thr 


Tyr 


Arg 


He 


Asn 


Tyr Ala 


Phe Asn 


His 


Arg 


Leu 


Glu 


930 










935 






940 










Asn Phe 


Phe 


Ala 


Val 


Asp 


Pro 


Asp 


Thr Gly 


Glu Leu 


Phe 


Val 


His 


Phe 


945 








950 








955 








960 


Thr Thr 


Ser 


Glu 


Val 


Leu 


Asp 


Arg 


Asp Gly 


Glu Glu 


Pro 


Glu 


His 


Arg 








965 








970 








975 




He He 


Phe 


Thr 


He 


Val 


Asp 


Asn 


Leu Glu Gly Ala 


Gly 


Asp 


Gly 


Asn 






980 










985 






990 






Gin Asn 


Thr 


He 


Ser 


Thr 


Glu 


Val 


Arg Val 


He Leu 


Leu 


Asp 


He 


Asn 




995 










1000 




1005 






Asp Asn 


Lys 


Pro 


Glu 


Leu 


Pro 


He 


Pro Asp Gly Glu 


Phe 


Trp 


Thr 


Val 


1010 








1015 




1020 








Ser Glu Gly 


Glu 


Val 


Glu 


Gly 


Lys 


Arg He 


Pro Pro 


Glu 


He 


His 


Ala 


1025 








1030 






1035 








1040 


His Asp Arg 


Asp 


Glu 


Pro 


Phe Asn Asp Asn 


Ser Arg Val Gly 


Tyr 


Glu 






1045 






1050 






1055 


He Arg 


Ser 


He 


Lys 


Leu 


He 


Asn 


Arg Asp 


He Glu 


Leu 


Pro 


Gin Asp 




1060 








1065 






1070 




Pro Phe 


Lys 


He 


He 


Thr 


He 


Asp 


Asp Leu 


Asp Thr 


Trp 


Lys 


Phe 


Val 



1075 1080 1085 



Gly Glu Leu Glu Thr Thr Met Asp Leu Arg Gly Tyr Trp Gly Thr Tyr 

1090 1095 1100 

Asp Val Glu He Arg Ala Phe Asp His Gly Phe Pro Met Leu Asp Ser 
1105 1110 1115 H20 

Phe Glu Thr Tyr Gin Leu Thr Val Arg Pro Tyr Asn Phe His Ser Pro 

1125 1130 1135 

Val Phe Val Phe Pro Thr Pro Gly Ser Thr He Arg Leu Ser Arg Glu 

1140 1145 1150 

Arg Ala He Val Asn Gly Met Leu Ala Leu Ala Asn He Ala Ser Gly 

1155 H60 1165 

Glu Phe Leu Asp Arg Leu Ser Ala Thr Asp Glu Asp Gly Leu His Ala 

1170 1175 1180 

Gly Arg Val Thr Phe Ser lie Ala Gly Asn Asp Glu Ala Ala Glu Tyr 
1185 1190 1195 1200 

Phe Asn Val Leu Asn Asp Gly Asp Asn Ser Ala Met Leu Thr Leu Lys 
1205 1210 1215 
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Gin Ala Leu Pro Ala Gly Val Gin Gin Phe Glu Leu Val lie Arg Ala 

1220 1225 1230 

Thr Asp Gly Gly Thr Glu Pro Gly Pro Arg Ser Thr Asp Cys Ser Val 

1235 1240 1245 

Thr Val Val Phe Val Met Thr Gin Gly Asp Pro Val Phe Asp Asp Asn 

1250 1255 1260 

Ala Ala Ser Val Arg Phe Val Glu Lys Glu Ala Gly Met Ser Glu Lys 
1265 1270 1275 1280 

Phe Gin Leu Pro Gin Ala Asp Asp Pro Lys Asn Tyr Arg Cys Met Asp 

1285 1290 1295 

Asp Cys His Thr He Tyr Tyr Ser He Val Asp Gly Asn Asp Gly Asp 

1300 1305 1310 

His Phe Ala Val Glu Pro Glu Thr Asn Val He Tyr Leu Leu Lys Pro 

1315 1320 1325 

Leu Asp Arg Ser Gin Gin Glu Gin Tyr Arg Val Val Val Ala Ala Ser 

1330 1335 1340 

Asn Thr Pro Gly Gly Thr Ser Thr Leu Ser Ser Ser Leu Leu Thr Val 
1345 1350 1355 1360 

Thr He Gly Val Arg Glu Ala Asn Pro Arg Pro He Phe Glu Ser Glu 

1365 1370 1375 

Phe Tyr Thr Ala Gly Val Leu His Thr Asp Ser He His Lys Glu Leu 

1380 1385 1390 

Val Tyr Leu Ala Ala Lys His Ser Glu Gly Leu Pro He Val Tyr Ser 

1395 1400 1405 

He Asp Gin Glu Thr Met Lys He Asp Glu Ser Leu Gin Thr Val Val 

1410 1415 1420 

Glu Asp Ala Phe Asp He Asn Ser Ala Thr Gly Val lie Ser Leu Asn 
1425 1430 1435 1440 

Phe Gin Pro Thr Ser Val Met His Gly Ser Phe Asp Phe Glu Val Val 

1445 1450 1455 

Ala Ser Asp Thr Arg Gly Ala Ser Asp Arg Ala Lys Val Ser He Tyr 

1460 1465 1470 

Met He' Ser Thr Arg Val Arg Val Ala Phe Leu Phe Tyr Asn Thr Glu 

1475 1480 1485 

Ala Glu Val Asn Glu Arg Arg Asn Phe He Ala Gin Thr Phe Ala Asn 

1490 1495 1500 

Ala Phe Gly Met Thr Cys Asn He Asp Ser Val Leu Pro Ala Thr Asp 
1505 1510 1515 1520 

Ala Asn Gly Val He Arg Glu Gly Tyr Thr Glu Leu Gin Ala His Phe 

1525 1530 , 1535 

He Arg Asp Asp Gin Pro Val Pro Ala Asp Tyr He Glu Gly Leu Phe 

1540 1545 1550 

Thr Glu Leu Asn Thr Leu Arg Asp He Arg Glu Va.1 Leu Ser Thr Gin 

1555 1560 1565 

Gin Leu Thr Leu Leu Asp Phe Ala Ala Gly Gly Ser Ala Val Leu Pro 

1570 1575 1580 

Gly Gly Glu Tyr Ala Leu Ala Val Tyr He Leu Ala Gly lie Ala Ala 
1585 1590 1595 1600 

Leu Leu Ala Val lie Cys Leu Ala Leu Leu lie Ala Phe Phe He Arg 

1605 1610 1615 

Asn Arg Thr Leu Asn Arg Arg He Glu Ala Leu Thr lie Lys Asp Val 

1620 1625 1630 

Pro Thr Asp lie Glu Pro Asn His Ala Ser Val Ala Val Leu Asn lie 

1635 1640 1645 

Asn Lys His Thr Glu Pro Gly Ser Asn Pro Phe Tyr Asn Pro Asp Val 

1650 1655 1660 

Lys Thr Pro Asn Phe Asp Thr lie Ser Glu Val Ser Asp Asp Leu Leu 
1665 1670 1675 1680 

Asp Val Glu Asp Leu Glu Gin Phe Gly Lys Asp Tyr Phe Pro Pro Glu 

1685 1690 1695 

Asn Glu lie Glu Ser Leu Asn Phe Ala Arg Asn Pro lie Ala Thr His 
1700 1705 1710 
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Gly Asn Asn Phe Gly Val Asn Ser Ser Pro Ser Asn Pro Glu Phe Ser 

1715 1720 1725 . „ 

Asn Ser Gin Phe Arg Ser 
1730 

<210> 7 
<211> 1604 
<212> DNA 

<213> Ostrinia nubilalis 
<400> 7 

tccgaattct tcttcaacct catcgacaac ttcttttctg acggtgacgg taggagaaac 60 

caggacgaag ttgaaatatt tgtcgttcta ttggatgtga acgacaacgc tcctgagatg 120 

ccatcgcctg atgaactccg gtttgatgtt tccgaaggag cagttgctgg tgtccgtgta 180 

ctcccagaaa tctacgcacc tgacagggat gaaccagaca cggacaactc gcgtgtcggt 240 

tacggaatcc tggacctcac gatcaccgac cgagacatcg aggtgccgga tctcttcacc 300 

atgatctcga ttgaaaacaa aactggggaa cttgagaccg ctatggactt gagggggtat 360 
tggggcactt acgaaatatt cattgaggcc ttcgaccacg gctacccgca gcagaggtcc - 420 

aacgggacgt acacactggt cattcgcccc tacaacttcc accaccctgt gttcgtgttc 4 80 

ccgcaacccg actccgtcat tcggctctct agggagcgcg caacagaagg cggggtcctg 540 

gcgacggctg ccaacgagtt cctggagccg atctacgcca ccgacgagga cggcctccac 600 

gcgggcagcg tcacgttcca cgtccaggga aatgaggagg ccgttcagta ctttgatata 660 

actgaagtgg gagcaggaga aaatagcggg cagcttatat tacgccagct tttcccagag 720 

caaatcagac aattcaggat cacgatccgg gccacagacg gcggcacgga gcccggcccg 780 

ctttggaccg acgtcacgtt ttcggtggtc ttcgtaccca cgcagggcga cccagtgttc 840 

agcgaaaatg cagctactgt tgccttcttc gagggtgaag aaggcctcca tgagagtttt 900 

gagctgccgc aagcagaaga ccttaaaaac cacctctgcg aagatgactg ccaagatatc 960 

tactacaggt ttattgacgg caacaacgag ggtctgttcg tgctggacca gtcgagcaac 1020 

gtcatctccc ttgcgcagga gttggaccgc gaggttgcca cgtcttacac gctgcacatc 1080 

gcggcgagca actcgcccga cgccactggg atccctctgc agacttccat cctcgttgtc 1140 

acggtcaatg taagagaagc gaacccgcgc ccaattttcg agcaggacct ttacacagcg 1200 

ggcatttcga cgttggacag cattggccgg gaattgctta ccgtcagggc gagccacaca 1260 

gaagacgaca ccatcacgta catcatagac cgtgcgagca tgcagctgga cagcagccta 1320 

gaagccgtgc gcgactcggc cttcacgctg catgcgacca ccggcgtgct ttcgctcaat 1380 

atgcagccca ccgcttccat gcacggcatg ttcgagttcg acgtcatcgc tacggataca 14 40 

gcatctgcaa tcgacacagc tcgtgtgaaa gtctacctca tctcatcgca aaaccgcgtg 1500 

tccttcattt tcgataacca acttgagacc gttgagcaga acagaaattt catagcggcc 1560 

acgttcagca ccgggttcaa catgacgtgt aacatcgacc aagt 1604 

<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 8 

gttamygtga gagaggcaga ycc 23 

<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic" oligonucleotide 
<400> 9 

ggatrttaag mgtcagyacw ccg 23 
<210> 10 
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<211> 32 

<212> DNA . - 

<213> Artificial Sequence 

<220> 

<223> Synthetic oligonucleotide 
<400> 10 

tccgaattct tcttyaacct catcgayaac tt 32 

<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 11 

cgcaagctta cttggtcgat gttrcasgtc at 32 
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